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Recommended Books and Resources 


There are a large number of books which cover the material in these lectures, although 
often from very different perspectives. They have titles like “Critical Phenomena”, 
“Phase Transitions”, “Renormalisation Group” or, less helpfully, “Advanced Statistical 
Mechanics”. Here are some that I particularly like 


e Nigel Goldenfeld, Phase Transitions and the Renormalization Group 


A great book, covering the basic material that we’ll need and delving deeper in places. 
e Mehran Kardar, Statistical Physics of Fields 


The second of two volumes on statistical mechanics. It cuts a concise path through 
the subject, at the expense of being a little telegraphic in places. It is based on lecture 
notes which you can find on the web; a link is given on the course website. 


e John Cardy, Scaling and Renormalisation in Statistical Physics 


A beautiful little book from one of the masters of conformal field theory. It covers the 
material from a slightly different perspective than these lectures, with more focus on 
renormalisation in real space. 


e Chaikin and Lubensky, Principles of Condensed Matter Physics 
e Shankar, Quantum Field Theory and Condensed Matter 


Both of these are more all-round condensed matter books, but with substantial sections 
on critical phenomena and the renormalisation group. Chaikin and Lubensky is more 
traditional, and packed full of content. Shankar covers modern methods of QFT, with 
an easygoing style suitable for bedtime reading. 


A number of excellent lecture notes are available on the web. Links can be found on 
the course webpage: http://www.damtp.cam.ac.uk/user/tong/sft.html. 
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Conventions 


In a previous course on Statistical Physics, we diligently kept factors of the Boltzmann 
constant, kz, in every equation. Now it is time to grow up. We set kg = 1. This means 
that we measure temperature in units of energy. 


0. Introduction 


Superficially, this course is about phase transitions. This is the name given to the 
abrupt, discontinuous changes that occur when matter is deformed in some way, whether 
through heating or squeezing or something else. The familiar example is the violent 
shaking of a pot on a stove as water approaches its boiling point, and bubbles of steam 
erupt from within. 


Despite their familiarity, phase transitions are striking, and even a little disconcert- 
ing. Usually in physics, things happens gradually. This fact is sewn into the heart 
of classical physics where the positions and momenta of particles are described by 
smooth, differentiable functions. Indeed, historically, the idea that change happens 
only infinitesimally resulted in the discovery of calculus. Yet, somehow, what holds 
on the micro level fails at the macro. Phase transitions tell us that a large number of 
particles can behave collectively in a way that any individual particle cannot, with the 
macroscopic properties of a system changing discontinuously. 


A closer look at what happens at phase transitions — in particular at so-called critical 
points — reveals something startling. Many different substances, regardless of their 
microscopic composition, exhibit identical behaviour at a phase transition. This is not 
just a qualitative statement, but a quantitative one. For example, as a liquid changes 
into a gas at the critical temperature T, the heat capacity diverges as 


1 
is [T — T,|0-11008... 
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The exponent is not known precisely. It is thought not to be a rational number, but 
should instead be viewed as a universal mathematical constant, similar to m or e, but 
more subtle. Remarkably, the same exponent occurs for all gases. It also occurs in other 
systems, including a certain class of magnets. It’s as if all knowledge of the microscopic 
physics has been washed away, leaving us with something pure, that carries only a 
vague memory of what lies underneath. This phenomenon is known as universality. 


All of this makes phase transitions interesting. They involve violence, universal truths 
and competition between rival states. The story of phase transitions is, quite literally, 
the song of fire and ice. 


And yet these are not the only reasons to study phase transitions. In our attempt 
to understand what happens as water boils, we will need to develop new tools and a 
new way of thinking about the world. This leads us to a paradigm which now underlies 
huge swathes of physics, far removed from its humble origin of a pot on a stove. This 


paradigm revolves around two deep facts about the Universe we inhabit: Nature is 
organised by symmetry. And Nature is organised by scale. 


Nature is Organised by Symmetry 


When I was a kid, I was told that there are three phases of matter: solid, liquid and 
gas. (Actually, this isn’t quite true. Knowing that I was interested in this kind of stuff, 
the teacher conspiratorially let on that there was a fourth phase of matter, “plasma”. 
To this day, I have no idea why. My best guess is that this fitted better with some old 
view of the basic elements as earth, water, air and fire.) 


It won’t be any surprise to learn that the real world is much more interesting than the 
one we’re introduced to as kids. There are not three phases of matter, nor four: there 
are many. A key insight, due to Landau, is that these different phases are characterised 
by symmetry. 


In this scheme a solid differs from a liquid because its crystal structure breaks the 
translational and rotational symmetries of space. Moreover, solids with different crystal 
structures should be viewed as different phases of matter because they break these sym- 
metries in different ways. Perhaps more surprisingly, liquids and gases break no such 
symmetries and so should be viewed as the same phase. When you include further sym- 
metries, such as rotations of spins in a magnet or more subtle quantum counterparts, 
this classification opens up a wide range of possibilities that allows us to understand 
almost all the known forms of matter. 


This characterisation has its advantages. First, we can be sure that any attempt to 
change a material from one symmetry class to another will necessarily involve a violent 
phase transition. Second, it turns out that understanding the symmetries of a system 
will immediately determine many of its properties, especially at low temperature. 


Moreover, the classification of matter in terms of symmetry has a power that goes 
far beyond its initial regime of application. The vacuum of space is, in many ways, like 
a complicated material, with quantum effects playing the role of thermal fluctuations. 
The vacuum can sit in different phases and is thought to have undergone several phase 
transitions as the Universe cooled after the big bang, each of which can be understood 
in terms of symmetries. All the ideas that we will develop here carry directly to theories 
of particle physics, cosmology and beyond. 


Nature is Organised by Scale 


There is an order to the Universe we live in. Roughly speaking, little things affect big 
things. Not the other way round. 


This is something you already know: particle physics underlies nuclear and atomic 
physics; atomic physics underlies condensed matter and chemistry; and so on up the 
chain. It’s certainly true that it can be difficult to make the leap from one level to the 
next, and new creative ideas are needed at each step, but this doesn’t change the fact 
that there is an ordering. Big things don’t affect little things. This is the reason there 
are no astrology departments in universities. 


But there is another aspect to this story, one which is not often stressed. Little 
things affect big things, but they rarely affect very big things. Instead, little things 
affect slightly bigger things. And these, in turn, affect slightly bigger things too. But 
as you go up the chain, you lose the information about what came long before. 


This again is something that you know. A zoologist who is interested in the way that 
starlings flock has little reason to study the dynamics of the Higgs boson. It’s also the 
reason that science is possible in the first place: neither Newton nor Einstein needed to 
understand how quantum gravity works on microscopic distance scales to write down 
theories that work extraordinarily well on larger scales. 


In the 1970s a mathematical formalism was developed that makes these ideas con- 
crete. This formalism is called the renormalisation group and provides a framework 
to describe physics at different scales. The renormalisation group gets little coverage 
in popular science articles, yet is arguably the single most important advance in the- 
oretical physics in the past 50 years. While zoologists may have little need to talk to 
particle physicists, the right way to understand both the Higgs boson and the flocking 
of starlings is through the language of the renormalisation group. 


These two ideas — symmetry and scale — now dominate the way we think about 
physics. Yet both have their origins in the simple question: what happens when you 
boil water? The purpose of this course is to find out. 


1. From Spins to Fields 


The ideas that we will introduce in these lectures have wide applicability across many 
areas of physics. However, we will spend much of these lectures studying just a single 
example. This example is known as the [sing model and it provides a particularly simple 
model of a magnet. Despite its simplicity, the Ising model already contains many of 
the concepts that will keep us occupied for the rest of the course. Having understood 
the Ising model in some detail, we see how these ideas play out in many other phases 
of matter in Section 4. 


1.1 The Ising Model 


The Ising model is easy to state, yet hard to solve. We have a lattice in d spatial 
dimensions, containing N sites. On each lattice site i = 1,..., N, there lives a discrete 
variable which can take one of two states: s; = +1 or s; = —1. 


It is useful to refer to these states as spins, with s; = +1 corresponding to spin up, 
and s; = —1 to spin down. However, we won’t be using any machinery from quantum 
mechanics: there are no non-commuting operators, nor quantum superpositions of 
states. In this sense, the Ising model, while discrete, is purely classical. 


The collection of spins {s;} has energy 
B=-BY s= JY sisj (1.1) 
t (ij) 


The first term arises due to an external magnetic field, B that we impose on the system. 
It has the effect that, for B > 0, the spins want to be up because that will lower their 
energy. (Properly this should be the magnetising field H, but we’re using B to avoid 
confusion with the Hamiltonian. There is also a factor of the magnetic moment that 
we’ve absorbed into B.) 


With the first term alone, the individual spins don’t talk to each other and the 
model is easy to solve. It is the second term that makes life more interesting. This is 
an interaction between neighbouring spins. The notation (ij7) means that we sum over 
all “nearest neighbour” pairs in the lattice. The number of such pairs depends both on 
the dimension d and the type of lattice. 


If J > 0, neighbouring spins prefer to be aligned (ff or J44). In the context of 
magnetism, such a system is called a ferromagnet. If J < 0, the spins want to anti- 
align (4). This is an anti-ferromagnet. In the following, we choose J > 0 although, 
for our purposes, the differences are minor. 


(For what it’s worth, the anti-ferromagnetic case, with J < 0, becomes more subtle 
on certain lattices where it’s not possible to arrange the spins so that they are opposite 
to all their neighbours. A 2d triangular lattice provides a simple example. The resulting 
physics is interesting and is known as (geometrical) frustration. We will not discuss it 
here.) 


We are interested in the physics of the Ising model at a finite temperature T. We can 
already get some intuition for what will happen. The interaction energy encourages 
the spins to align in the same way. The magnetic field encourages the spins to align 
in a particular way. Meanwhile, the temperature encourages the spins to ignore both 
the interactions and magnetic field because, at finite temperature, energy is no longer 
at a premium. Instead, entropy becomes more important. Since there are many more 
random configurations than aligned configurations, the temperature will tend to mess 
up the nice ordered states that the interactions and magnetic field have so carefully 
prepared. Already we can see that, like any good story, we’re starting with a healthy 
dose of narrative tension between our main characters. 


In the canonical ensemble, the probability of sitting in a configuration of spins {s;} 
is given by 
e FEIsi] 
Z 


where 8 = 1/T and the normalisation factor Z is called the partition function, and is 
given by 


plsi] = (1.2) 


aT JB) = Se Fes) (1.3) 
{si} 


The beauty of statistical mechanics is that even though Z is first introduced as an 
innocuous normalisation factor, it actually contains everything we want to known about 
the system. If we’re able to perform the sum to compute Z, we can extract any 
information we want!. For example, the interplay between energy and entropy S is 
captured by the thermodynamic free energy 


Finermo(T, B) = (E) — TS = -T log Z (1.4) 


As the notation shows, this is a function of thermodynamic variables, T and B. (We 
don’t usually add the subscript “thermo” to the free energy but for much of these 
lectures we'll be working with a more refined version of the free energy which we'll 
shortly introduce.) 


'The basic machinery of partition functions was described in the first course on Statistical Physics. 


Of particular interest to us will be the average spin of the configuration, which we 
refer to as the equilibrium magnetization. 


1 
m= wid) (1.5) 


This takes values is the range m € [—1,+1]. From our discussion above, we would 
expect that, for B > 0, m — +1 at low temperatures where the spins are ordered, 
and m — 0 at high temperatures where the spins are arranged randomly. We’d like to 
make this intuition more precise. Using the probability (1.2), it is straightforward to 
check that the magnetisation can be written as 
1 eFElsi] 1 dOlogZ 
m= Si = — 1.6 
De E ao 
Taking further derivatives allows us to compute higher moments of the distribution, 


and so learn about fluctuations around the average. We will see this in action later in 
these lectures. 


Our task is now clear: we should compute the sum (1.3). This is easier said than 
done. It turns out that the sum is straightforward in a d = 1 dimensional lattice, and 
you will do this on an example sheet. An exact solution also exists in d = 2 when 
B = 0, originally due to Onsager. It is not straightforward. In higher dimensions, no 
exact solutions are available, although various expansions and tricks have been invented 
to manipulate the sum (1.3) to extract some interesting information. 


In these lectures, we will not attempt to directly sum the microscopic spins in the 
partition function. Instead, we will rephrase the question. We will massage the partition 
function into a somewhat different form, one that has much broader application. 


1.1.1 The Effective Free Energy 


The key idea was explained in the introduction: Nature is organised by scale. The 
energy of the Ising model (1.1) provides the microscopic description of our system in 
terms of individual spins. Computing the partition function exactly gives us a macro- 
scopic description of the system in terms of thermodynamic variables like temperature 
T and magnetic field B. What we’re looking for is something in between. 


We're going to get to this “something in between” in two steps: we’ll do something 
rather crude here, and then do a better job in Section 1.3. For our crude attempt, we 
rewrite the partition function (1.3) in the following manner: 


a 5 > e FFs] .— Ser (1.7) 


m — {si}|m 


where the notation {s;}|m means all configurations of spins such that >>>, s; = m. 
In other words, we first sum over all configurations with fixed average magnetisation 
n= + Š s;, and subsequently sum over all possible m. 


Note that we’re using m here in a subtly different way to the original definition 
(1.5). In the sum (1.7), the magnetisation refers to the average magnetisation of a 
given configuration and can take any value. In contrast, in (1.5) we are talking about 
the equilibrium value of magnetisation, averaged over all configurations in the canonical 
ensemble. We will see shortly how to find this equilibrium value. 


The average magnetisation lies in the range —1 < m < 1. Strictly speaking, this takes 
only discrete values, quantised in units of 2/N. However, we are ultimately interested 
in the limit of large N, so we don’t lose anything by writing this as an integral, 


N +1 
Z= `f dm ePE™ (1.8) 
zl 


where the factor of N/2 is the (inverse) width between the allowed m values. Such 
overall coefficients in the partition function are unimportant for the physics, and we 
will not be careful in keeping them below. 


This way of writing things has allowed us to define something new: an effective free 
energy, F'(m), which depends on the magnetisation m of the system, in addition to 
both T and B. This goes beyond the usual, thermodynamic idea of free energy Finermo 
given in (1.4) which is defined only in equilibrium, where the magnetisation m takes a 
specific value, determined by (1.6). 


Note that (1.8) looks very much like a standard path integral, with F (m) playing the 
role of the energy for m. But there is a difference because, unlike in the microscopic 
theory, F(m) can depend on temperature. This means that the 8 dependence in the 
exponent can be more complicated than we’re used to. We’ll see this explicitly below. 


The effective free energy F'(m) contains more information than the thermodynamic 
free energy Finermo- In particular, F(m) can tell us the correct, equilibrium value of 
the magnetisation m. To see this, we define the free energy per unit spin, 


F(m) 


fim) = -F 


Our partition function becomes 


Z= fom e TPN Fm) 


Here N is a very large number (think N ~ 10%) while 8f(m) ~ 1. Integrals of this kind 
are very well approximated by the value of m which minimises f(m), an approximation 
known as the saddle point or steepest descent, 


Of B 
Im =0 


mM=Mmin 


The minimum Mmin is the equilibrium value of the magnetisation that we previously 
computed in (1.6). Substituting this into the the partition function, we have 


ZZ e BNF (Mmin) => Fihermo X F(Mmin) (1.9) 
In this way, we can reconstruct Finermo from knowledge of F (m). 


1.1.2 Mean Field Theory 


To proceed, we want to find a way to compute F (m) = N f(m), defined in (1.7). But 
this is tantamount to performing the sum in the path integral and, as we stressed above, 
this is not easy. We need to find another way. 


We will use a method called the mean field approximation. Here the word “approx- 
imation” is somewhat generous; a better name would be the mean field “guess” since 
there is little justification for what we’re about to do. Instead, the purpose is to provide 
a starting point for our discussion. Not all the consequences that we derive from this 
guess will be accurate, but as the lectures progress we’ll get a better understanding 
about what we can trust, what we can’t, and how to do better. 


We wish to sum over configurations {s;} with }°,s; = Nm. We can get an esti- 
mate for the energy of such configurations by replacing each spin s in (1.1) with its 
expectation (i.e. mean) value (s) = m, 


E 1 
E=-BY m-JX m => y= Bm sJam’ (1.10) 
i (ij) 


Here q denotes the number of nearest neighbours of each spin. For example, in d = 1 a 
lattice has q = 2; in d = 2, a square lattice has q = 4. A square lattice in d dimensions 
has q = 2d. The factor or 1/2 is there because )-,;,, is a sum over pairs rather than a 
sum of individual sites. 


Among other things, this means that we’re assuming (falsely!) that the energy de- 
pends only on the magnetisation m of a configuration, rather than any more finely 
grained details. The sole reason for doing this is that it makes the resulting sum over 


configurations {s;}|m very easy to do: we simply need to count the number of config- 
urations with magnetisation m. A configuration with N} up spins and N, = N — M; 
down spins has magnetisation 


O M-N, _ 2N,-N 
N N 


The number of such configurations is 


N! 


Gano 
N,I(N — Ny)! 


and we can use Stirlings formula to evaluate 
log Q ~ N log N = N+ log N+ = (N = Ny) log(N = N;) 


log Q 1 1 
Be i log 2 — 5 (1 +m) log(1 +m) — 5( —m) log(1 — m) 


In our mean field approximation, the effective free energy defined in (1.7) is then given 
by 


e PNE(™) x O(m) e PEM 


Substituting the energy (1.10), and taking logs of both sides, we find ourselves with 
the following expression: 


f(m) = —Bm — Jm? -T (1082 — La + m)log(1 +m) — La — m) log(1 — m)) 


From this, we can compute the equilibrium value for m. As explained above, this occurs 
at the minimum 


oF A 


0 => pB(B+Jqm)= 5 oe (1) 
om 
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A little algebra shows us that the equilibrium magnetisation obeys the self-consistency 
condition 


m = tanh (8B + BJqm) (1.11) 


There’s a nice intuition behind this equation. It can be derived by assuming that each 
spin experiences an effective magnetic field given by Beg = B+Jqm, which includes an 
extra contribution from the spins around it. In this way, the tricky interactions in the 
Ising model have been replaced by an averaged effective field Beg. This is sometimes 
called the mean field and gives its name to this technique. 


= ]0 = 


There are a number of ways forward at this point. We could, for example, analyse the 
properties of the Ising model by looking at solutions to the self-consistency condition 
(1.11); this was the strategy taken in the Statistical Physics lectures. However, instead 
we’re going to focus on the free energy f(m), since this will prove to be the more useful 
tool moving forward. 


1.2 Landau Approach to Phase Transitions 


A phase transition occurs when some quantity changes in a discontinuous fashion. For 
us, this quantity is m which, as we will see, will exhibit non-analytic behaviour as we 
vary different parameters. 


Landau theory is a simple, yet effective way to understand the qualitative manner 
in which these phase transitions occur. It is based on two ideas: the free energy, and 
symmetry. Here we will apply Landau’s theory to the Ising model, focussing first on 
the free energy. We will comment on the role of symmetry at the end of this section, 
although the full power of this idea will only become apparent in Section 4 where we 
describe a whole raft of further examples. 


In the previous section, we introduced the effective free energy F(m) = Nf(m), 
which is a function of the magnetisation m. This kind of object is the starting point for 
Landau’s theory. In this context, the magnetisation is known as an order parameter. 
Using the mean field approximation, we computed this to be 


f(m) = —Bm — sda? -T (1082 — Ti +m) log(1 +m) — (1 —m) log(1 — m)) 


Landau’s approach focuses on situations where the order parameter is small. Here we 
can Taylor expand, 


1 1 
f(m) = —Tlog2 — Bm + 5(T — Jq)m* + —Tm* +... (1.12) 


As we’ve seen above, the preferred value of m is simply the minimum of f(m). The 
idea of Landau theory is simply to observe how the function f(m), and in particular its 
minima, change as we vary different parameters. Crucially, the story is rather different 
depending on whether B # 0 or B = 0. We will deal with each of these in turn. 


1.2.1 B=0: A Continuous Phase Transitions 


Let’s first consider the situation with vanishing magnetic field, B = 0, so that there is 
no term in (1.12) that is linear in m. Since our interest lies in the m dependence of 


=] = 


f(m) f(m) 
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Figure 1: Free energy when T > Te Figure 2: Free energy when T < Te 


f(m), we won’t lose anything if we drop the constant —T log 2 term. We’re left with 
the free energy 


f(m) = (Tf — Jq)m + arm +... (1.13) 


NI =| 


The behaviour of the free energy depends on whether the quadratic term is positive or 
negative. To distinguish between these two, we define the critical temperature 


The free energy is sketched in the case where T > T, (on the left) and T < T, (on the 
right). We see that at high temperatures, the magnetisation vanishes at the minimum: 
m = 0. This is in agreement with our expectation that temperature will randomise the 
spins. However, as the temperature is reduced below Te, the point m = 0 becomes a 
maximum of the free energy and the minima now lie at m = +mo which, if we chose 
to truncate the free energy (1.13) at order m4, is given by 


aT) 
T 


(1.14) 


Mo = 
This form is valid only when T is close to T, so that A 
m is small and higher order terms can be ignored. As 


+1 
the temperature is lowered further, the minimum mo 


grows. We’re then obliged to turn to the full form > 
of the free energy f(m) which, among other things, 
knows that the magnetisation lies in the range m € 
[—1, +1]. 


Figure 3: 
The upshot is that as we vary the temperature, the 


magnetisation takes the form shown on the right. This is perhaps somewhat surprising. 
Usually in physics, things turn on gradually. But here the magnetisation turns off 


aya 


abruptly at T = Tą, and remains zero for all higher temperatures. This kind of sharp 
change is characteristic of a phase transition. When m = 0, we say that the system 
sits in the disordered phase; when m # 0, it is in the ordered phase. 


The magnetisation itself is continuous and, for this reason, it is referred to as a 
continuous phase transition or, sometimes, a second order phase transition. 


As an aside: phase transitions can be classified by looking at the thermodynamic free 
energy Finermo = N f (Mmin) and taking derivatives with respect to some thermodynamic 
variable, like T. If the discontinuity first shows up in the n™ derivative, it is said to be 
an n*™ order phase transition. However, in practice we very rarely have to deal with 
anything other than first order transitions (which we will see below) and second order 
transitions. In the present case, we’ll see shortly that the heat capacity is discontinuous, 
confirming that it is indeed a second order transition. 


The essence of a phase transition is that some quantity is discontinuous. Yet, this 
should make us nervous. In principle, everything is determined by the partition function 
Z, defined in (1.3), which is a sum of smooth, analytic functions. How is it possible, 
then, to get the kind of non-analytic behaviour characteristic of a phase transition? 
The loophole is that Z is only necessarily analytic if the sum is finite. But there is no 
such guarantee that it remains analytic when the number of lattice sites N — oo. This 
means that genuine, discontinuous phase transitions only occur in infinite systems. 
In reality, we have around N ~ 107° atoms. This gives rise to functions which are, 
strictly speaking, smooth, but which change so rapidly that they act, to all intents and 
purposes, as if they were discontinuous. 


These kind of subtleties are brushed under the carpet in the mean field approach 
that we’re taking here. However, it’s worth pointing out that the free energy, f(m) is 
an analytic function which, when Taylor expanded, gives terms with integer powers of 
m. Nonetheless, the minima of f behave in a non-analytic fashion. 


For future purposes, it will be useful to see how the heat capacity C = O(E)/0T 
changes as we approach T}. In the canonical ensemble, the average energy is given by 
(E) = —Olog Z/08. From this, we find that we can write the heat capacity as 


oe io Z (1.15) 


To proceed, we need to compute the partition function Z, by evaluating f(m) at the 
minimum value Min as in (1.9). When T > T, this is simple: we have Mmin = 0 and 
f(Mmin) = 0 (still neglecting the constant T log 2 term which doesn’t contribute to the 


—13- 


heat capacity). In contrast, when T < T, the minimum lies at m = mp given in (1.14), 
and the free energy is f(mo) = —3(T, — T)?/T. 


Now we simply need to differentiate to get the heat ca- a 
pacity, C. The leading contribution as T —> T, comes from | 
differentiating the (T. — T) piece, rather than the 1/T piece. Se 
We have | m 


0 as T > T+ 
(1.16) piesa 
3/2 asT >T; a aaa 


We learn that the heat capacity jumps discontinuously. The part of the heat capacity 
that comes from differentiating (T — T.) terms is often called the singular piece. We’ll 
be seeing more of this down the line. 


Spontaneous Symmetry Breaking 


There is one further aspect of the continuous phase transition that is worth highlighting. 
The free energy (1.13) is invariant under the Zə symmetry m — —m. This is no 
coincidence: it follows because our microscopic definition of the Ising model (1.1) also 
enjoys this symmetry when B = 0. 


However, below T}, the system must pick one of the two ground states m = +mo or 
m = —mo. Whichever choice it makes breaks the Zə symmetry. When a symmetry of a 
system is not respected by the ground state we say that the symmetry is spontaneously 
broken. This will become an important theme for us as we move through the course. 
It is also an idea which plays an important role in particle physics . 


Strictly speaking, spontaneous symmetry breaking can only occur in infinite systems, 
with the magnetisation defined by taking the limit 
1 
m= lim lim — 5 (s) 


B30 N=>œ N 


a 


It’s important that we take the limit N — oo, before we take the limit B — 0. If we 
do it the other way round, we find }°(s;) > 0 as B —> 0 for any finite N. 


1.2.2 B #0: First Order Phase Transitions 


Let’s now study what happens when B # 0. Once again, we ignore the constant 
—T log 2 term in the free energy (1.12). We’re left with the free energy 


1 1 
f(m) = —Bm + 5(T — Jq)m* + Tm +... (1.17) 
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f(m) f(m) 


m 


Figure 5: Free energy at high tempera- Figure 6: Free energy at low tempera- 
tures tures. 


There are again two, qualitatively different forms of this function at low and high 
temperatures, shown in the figures above for B > 0. 


At low temperatures there are two minima, but one is always lower than the other. 
The global minima is the true ground state of the system. The other minima is a 
meta-stable state. The system can exit the meta-stable state by fluctuating up, and 
over the energy barrier separating it from the ground state, and so has a finite lifetime. 
As we increase the temperature, there is a temperature (which depends on B) beyond 
which the meta-stable state disappears. This temperature is referred to as the spinodal 
point. It will not play any further role in these lectures. 


For us, the most important issue is that the ground state of the system — the global 
minimum of f(m) — does not qualitatively change as we vary the temperature. At high 
temperatures, the magnetisation asymptotes smoothly to zero as 


Ain 
+1 
B B>0 
m > — as T > co T 

T 
At low temperatures, the magnetization again asymptotes ~! es 
to the state m — +1 which minimises the energy. Except 
this time, there is no ambiguity as to whether the system Figure 7: 


chooses m = +1 or m = —1. This is entirely determined 

by the sign of the magnetic field B. A sketch of the magnetisation as a function of 
temperature is shown on the right. The upshot is that, for B 4 0, there is no phase 
transition as a function of the temperature. 


However, we do not have to look to hard to find a phase transition: we just need to 
move along a different path in the phase diagram. Suppose that we keep T fixed at a 
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Figure 8: T < Te: The free energy of the Ising model for B < 0 on the left, B = 0 in the 
middle, and B > 0 on the right. 


value below T}. We then vary the magnetic field from B < 0 to B > 0. The resulting 
free energy is shown in Figure 8. 


We see that the magnetisation jumps discontinuously from —mp to +mpo as B flips 
from negative to positive. This is an example of a first order phase transition. 


Our analysis above has left us with the following picture B 
of the phase diagram for the Ising model: if we vary B from 
positive to negative then we cross the red line in the figure and T 
the system suffers a first order phase transition. Note, however, Te 


that if we first raise the temperature then it’s always possible to 


move from any point B Æ 0 to any other point without suffering 
a phase transition. Figure 9: 


This line of first order phase transitions ends at a second order phase transition at 
T =T.. This is referred to as the critical point. 


Close to the Critical Point 


It will prove interesting to explore what happens when we sit close to the critical tem- 
perature T = T}. There are a bunch of different questions that we can ask. Suppose, for 
example, that we sit at T = T, and vary the magnetic field: how does the magnetisation 
change? Here the free energy (1.17) becomes 
1 
f(m) = -Bm + qo rem" +... 


For m small, where we can neglect the higher order terms, minimising the free energy 
gives m? ~ B. So, when B > 0 we have 


m ~ B!’ (1.18) 


and when B < 0 we have m ~ —|B|!/3. 
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Here is another question: the magnetic susceptibility x is defined as 


_ Om 


= 5B, (1.19) 


X 
We will compute this at B = 0, and as T — T, from both above and below. First, 
from above: when T = T, we can keep just the linear and quadratic terms in the free 
energy 


B i 
T-T. A TaT. 


1 
f(m) ~ -Bm + 5(T — T.)m’ > mes 


When T < T,, we need to work a little harder. For small B, we can write the minimum 
of (1.17) as m = mo + ôm where mo is given by (1.14). Working to leading order in 
B/T, we find 


mx my t -n > yy 


= 7) Lt 


We can combine these two results by writing 


1 


aS E 1.20 
|T ~ Tol l ) 


X 


We’ll see the relevance of this shortly. 


1.2.3 Validity of Mean Field Theory 


The first thing that we should ask ourselves is: are the results above right?! We have 
reason to be nervous because they were all derived using the mean field approximation, 
for which we offered no justification. On the other hand, there is reason to be optimistic 
because, at the end of the day, the structure of the phase diagram followed from some 
fairly basic properties of the Taylor expansion of the free energy. 


In this section and the next, we will give some spoilers. What follows is a list of 
facts. In large part, the rest of these lectures will be devoted to explaining where these 
facts come from. It turns out that the validity of mean field theory depends strongly 
on the spatial dimension d of the theory. We will explain this in detail shortly but here 
is the take-home message: 


e In d= 1 mean field theory fails completely. There are no phase transitions. 


e In d = 2 and d = 3 the basic structure of the phase diagram is correct, but 
detailed predictions at T ~ T, are wrong. 


= i= 


e Ind > 4, mean field theory gives the right answers. 


This basic pattern holds for all other models that we will look at too. Mean field theory 
always fails completely for d < dı, where dı known as the lower critical dimension. For 
the Ising model, d; = 1, but we will later meet examples where this approach fails in 
d= 2, 


In contrast, mean field theory always works for d > de, where de is known as the 
upper critical dimension. For the Ising model, mean field theory works because, as 
d increases, each spin has a larger number of neighbours and so indeed experiences 
something close to the average spin. 


Critical Exponents 


What about the intermediate dimensions, dj < d < de? These are very often the 
dimensions of interest: for the Ising model it is d = 2 and d = 3. Here the crude 
structure of the phase diagram predicted by mean field theory is correct, but it gives 
misleading results near the critical point T = T}. 


To explain this, recall that we computed the behaviour of four different quantities 
as we approach the critical point. For three of these, we fixed B = 0 and dialled the 
temperature towards the critical point. We found that, for T < T., the magnetisation 
(1.14) varies as 


l 
m ~ (T-T)? with B= 5 (1.21) 
The heat capacity (1.16) varies as 


e~es|[—-T,|"* witha =0 (1.22) 


where the c+ is there to remind us that there is a discontinuity as we approach T, from 


above or below. The magnetic susceptibility (1.20) varies as 


1 


X 


The fourth quantity requires us to take a different path in the phase diagram. This time 
we fix T and dial the magnetic field B towards zero, in which case the magnetisation 
(1.18) varies as 


m~ B? withô=3 (1.23) 


The coefficients a, 6, y and 6 are known as critical exponents. (The Greek letters are 
standard notation; more generally, one can define a whole slew of these kind of objects.) 
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Figure 10: Magnetisation m, plotted Figure 11: Heat capacity c ~ |T —T.|~° 
against temperature for an Ising magnet, for various gases shows good agreement 
suggests that 1/8 ~ 3. with a ~ 1/8. 


When the Ising model is treated correctly, one finds that these quantities do indeed 
scale to zero or infinity near T}, but with different exponents. Here is a table of our 
mean field (MF) values, together with the true results for d = 2 and d = 3, 


MF d=2 d=3 


a | 0 (disc.) 0 (log) 0.1101 
b 5 5 0.3264 
y 1 zo aL 2371 
ô 3 15 4.7898 


Note that the heat capacity has critical exponent œ = 0 in both mean field and in 
d = 2, but the discontinuity seen in mean field is replaced by a log divergence in d = 2. 


The d = 2 results are known analytically, while the d = 3 results are known only 
numerically (to about 5 or 6 significant figures; I truncated early in the table above). 
Both the d = 2 and d = 3 results are also in fairly good agreement with experiment, 
which confirm that the observed exponents do not take their mean field values. For 
example, the left-hand figure above shows the magnetisation m!/° ~ (T — T.) taken 
from MnF», a magnet with uniaxial anisotropy which is thought to be described by the 
Ising model’. The data shows a good fit to 1/8 = 3; as shown in the table, it is now 
thought that the exponent is not a rational number, but 1/8 œ~ 3.064. 


?This data is taken from P. Heller and G. Benedek, Nuclear Magnetic Resonance in MnF, Near 
the Critical Point, Phys. Rev. Lett. 8, 428 (1962). 
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This kind of behaviour is very surprising. It’s rare that we see any kind of non- 
analytic behaviour in physics, but rarer still to find exponents that are not integers or 
simple fractions. What is going on here? This is one of the questions we will answer 
as these lectures progress. 


1.2.4 A First Look at Universality 


Before we go digging further into the Ising model, p critical point 
there is one other important aspect that deserves a men- 
tion at this point. The phase diagram for the Ising model 
is rather similar to the phase diagram for the transition 
between a liquid and gas, now drawn in the pressure- 


temperature plane. This is shown in the figure. In both 


cases, there is a line of first order transitions, ending at 


To 


a critical point. For water, the critical point lies at a 
temperature T, ~ 374 °C and a pressure pe © 218 atm. Figure 12: 


The similarities are not just qualitative. One can use an appropriate equation of 
state for an interacting gas — say, the van der Waals equation — to compute how various 
quantities behave as we approach the critical point. (See the lectures on Statistical 
Physics for details of these calculations.) As we cross the first order phase transition, 
keeping the pressure constant, the volume V of the liquid/gas jumps discontinuously. 
This suggests that the rescaled volume v = V/N, where N is the number of atoms in 
the gas, is analogous to m in the Ising model. We can then ask how the jump in v 
changes as we approach the critical point. One finds, 


1 
Ugas — Vliquid ~ (T; = T)’ where B = z 


From the phase diagram, we see that the pressure p is analogous to the magnetic field 
B. We could then ask how the volume changes with pressure as we approach the critical 
point keeping T = T, fixed. We find 


Ugas — Vliquid ™ (p = pe)" where 6 = 3 


Finally, we want the analog of the magnetic susceptibility. For a gas, this is the 


compressibility, « = — t 2 


z ae As we approach the critical point, we find 


1 


"nT 


where y = 1 


It has probably not escaped your attention that these critical exponents are exactly 
the same as we saw for the Ising model when treated in mean field. The same is also 
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true for the heat capacity Cy which approach different constant values as the critical 
point is approached from opposite sides. 


However, the coincidence doesn’t stop there. Because, it turns out, that the critical 
exponents above are also wrong! The true critical exponents for the liquid-gas transi- 
tions in d = 2 and d = 3 dimensions are the same as those of the Ising model, listed 
previously in the table. For example, experimental data for the critical exponent a of a 
number of different gases was plotted two pages back? showing that it is approximately 
a x 0.1, and certainly not consistent with mean field expectations. 


This is an astonishing fact. It’s telling us that at a second order phase transition, all 
memory of the underlying microscopic physics is washed away. Instead, there is a single 
theory which describes the physics at the critical point of the liquid-gas transition, the 
Ising model, and many other systems. This is a theoretical physicist’s dream! We 
spend a great deal of time trying to throw away the messy details of a system to focus 
on the elegant essentials. But, at a critical point, Nature does this for us. Whatever 
drastic “spherical cow” approximation you make doesn’t matter: if you capture the 
correct physics, you will get the exact answer! The fact that many different systems 
are described by the same critical point is called universality. 


We might ask: does every second order phase transition have the same critical ex- 
ponents as the Ising model? The answer is no! Instead, in each dimension d there is 
a set of critical points. Any system that undergoes a second order phase transition is 
governed by one member of this set. If two systems are governed by the same critical 
point, we say that they lie in the same universality class. The choice of universality 
class is, in large part, dictated by the symmetries of the problem; we will see some 
examples in Section 4. 


The Ising Model as a Lattice Gas 


It is, at first sight, surprising that a magnet and gas lie in the same universality class. 
However, there is a different interpretation of the Ising model that makes it look a little 
more gassy. 


3The data is taken from J. Lipa, C. Edwards, and M. Buckingham Precision Measurement of the 
Specific Heat of CO2 Near the Critical Point’, Phys. Rev. Lett. 25, 1086 (1970). This was before the 
theory of critical phenomena was well understood. The data shows a good fit with a ~ 1/8, not too 
far from the now accepted value a ~ 0.11. Notice that the data stops around t = 1 — T/T. ~ 1074. 
This is apparently because the effect of gravity becomes important as the critical point is approached, 
making experiments increasingly difficult. 


=e 


To see this, consider the same d-dimensional lattice as before, but now with particles 
hopping between lattice sites. These particles have hard cores, so no more than one 
can sit on a single lattice site. We introduce the variable n; € {0,1} to specify whether 
a given lattice site, labelled by i, is empty (n; = 0) or filled (n; = 1). We can also 
introduce an attractive force between atoms by offering them an energetic reward if 
they sit on neighbouring sites. The Hamiltonian of such a lattice gas is given by 


E= —4J X ninj -uX ni 
(ij) i 


where u is the chemical potential which determines the overall particle number. But this 
Hamiltonian is trivially the same as the Ising model (1.1) if we make the identification 


si = 2n; — 1 € {-1,1} 


The chemical potenial u in the lattice gas plays the role of magnetic field in the spin 
system while the magnetization of the system (1.6) measures the average density of 
particles away from half-filling. 


There’s no a priori reason to think that the Ising model is a particular good descrip- 
tion of a gas. Nonetheless, this interpretation may make it a little less surprising that 
the Ising model and a gas share the same critical point. 


1.3 Landau-Ginzburg Theory 


The idea of universality — that many different systems enjoy the same critical point — 
is a powerful one. It means that if we want to accurately describe the critical point, 
we needn’t concern ourselves with the messy details of any specific system. Instead, 
we should just search for the simplest model which gives the correct physics and work 
with that. 


What is the simplest model? The Landau approach — in which the configuration of 
the Ising model is reduced to a single number m — is too coarse. This is because it misses 
any spatial variation in the system. And, as we will see shortly, the critical point is 
all about spatial variations. Here we describe a simple generalisation of Landau’s ideas 
which allows the system to move away from a homogeneous state. This generalisation 
is known as Landau-Ginzburg theory. 


4It is also known as Ginzburg-Landau theory. The original paper, dating from 1950, has the 
authors in alphabetical order and constructs the free energy of a superconductor. Here we use the 
term Landau-Ginzburg theory to reflect the fact that, following Landau’s earlier work, these ideas 
apply more broadly to any system. 
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The key idea is simple: we take the order parameter from Landau theory — which, 
for the Ising model, is the magnetisation m — and promote it to a field which can vary 
in space, m(x). This is referred to as a local order parameter. 


How do we do this? We started with a lattice. We divide 
this lattice up many into boxes, with sides of length a. Each 
of these boxes contains many lattice sites — call it N’ — but the 
box size a is smaller than any other length scale in the game. 
(In particular, a should be smaller than something called the 
correlation length that we will encounter below.) For each box, 
we define the average magnetisation m(x) = 37 >); si, where 
x denotes the centre of the box. This kind of procedure in 


known as coarse-graining. 


Figure 13: 


There are all sorts of subtleties involved in this construction, 

and we will do our best to sweep them firmly under the rug. First, m(x) takes values 
only at certain values of x which label the positions of boxes. To mitigate this, we take 
the number of boxes N/N’ to be big enough so that x can be treated as a continuous 
variable. Second, at each x, m(x) is quantised in units of 1/N’. We will take N’ big 
enough so that m(x) can take any value in [—1,+1]. The upshot of this is that we will 
treat m(x) as a continuous function. However, we will at some point need to remember 
that it makes no sense for m(x) to vary on very short distance scales — shorter than 
the separation between boxes. 


You may have noticed that we’ve not been particularly careful about defining m(x) 
and this cavalier attitude will continue below. You might, reasonably ask: does the 
physics depend on the details of how we coarse grain? The answer is no. This is the 
beauty of universality. We’ll see this more clearly as we proceed. 


Our next step is to repeat the ideas that we saw in Section 1.1.1. Following (1.7), 
we write the partition function as 


o> o-BEIsi] «Sr @-BFim()] (1.24) 
m(x) {si}|m(x) m(x) 

Here, the notation {s;}|m(x) means that we sum over all configurations of spins such 

that the coarse graining yields m(x). In general, there will be many spin configurations 

for each m(x). We then sum over all possible values of m(x). 


This procedure has allowed us to define a free energy F'[m(x)]. This is a functional, 
meaning that you give it a function m(x) and it spits back a number, F[m(x)]. This 
is known as the Landau-Ginzburg free energy 
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We will invoke one last notational flourish. We’re left in (1.24) with a sum over all 
possible configurations m(x). With the assumption that m(x) is a continuous function, 
this is usually written as 


Z= f Dms) e Palme (1.25) 


This is a functional integral, also known as a path integral. The notation Dm(x) — 
which is usually shortened to simply Dm — means that we should sum over all field 
configurations m(x). 


Path integrals may be somewhat daunting at first sight. But it’s worth remembering 
where it comes from: an integration over m(x) at each point x labelling a box. In other 
words, it’s nothing more than an infinite number of normal integrals. We will start to 
learn how to play with these objects in Section 2. 


The path integral looks very much like a usual partition function, but with the 
Landau-Ginzburg free energy F'|m(x)] playing the role of an effective Hamiltonian for 
the continuous variable m(x). There is some nice intuition behind this. In the thermal 
ensemble, a given field configuration m(x) arises with probability 


e-BFIm(x)] 


a (1.26) 


plm(x)] = 
The path integral (1.25), is nothing but the usual partition function, now for a field 
m(x) rather than the more familiar variables of classical physics like position and 
momentum. In other words, we’re doing the statistical mechanics of fields, rather than 
particles. The clue was in the title of these lectures. 


1.3.1 The Landau-Ginzburg Free Energy 


The next step is to ask: how do we calculate F'|m(x)]? This seems tricky: already in 
our earlier discussion of Landau theory, we had to resort to an unjustified mean field 
approximation. What are we going to do here? 


The answer to this question is wonderfully simple, as becomes clear if we express the 
question in a slightly different way: what could the free energy F|m(x)] possibly be? 
There are a number of constraints on F'|m(x)| that arise from its microscopic origin 


e Locality: The nearest neighbour interactions of the Ising model mean that a spin 
on one site does not directly affect a spin on a far flung site. It only does so 
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through the intermediate spins. The same should be true of the magnetisation 
field m(x). The result is that the free energy should take the form 


F[m(x)] = J d's f[m(x)] 


where f[m(x)] is a local function. It can depend on m(x), but also on Vm(x) 
and higher derivatives. These gradient terms control how the field m(x) at one 
point affects the field at neighbouring points. 


e Translation and Rotation Invariance: The original lattice has a discrete transla- 
tion symmetry. For certain lattices (e.g. a square lattice) there can be discrete 
rotation symmetries. At distances much larger than the lattice scale, we expect 
that the continuum version of both these symmetries emerges, and our free energy 
should be invariant under them. 


e Z symmetry. When B = 0, the original Ising model (1.1) is invariant under 
the symmetry s; + —s;, acting simultaneously on all sites. This symmetry is 
inherited in our coarse-grained description which should be invariant under 


m(x) > —m(x) (1.27) 


When B ¥ 0, the Ising model is invariant under m(x) + —m(x), together with 
B — —B. Again, our free energy should inherit this symmetry. 


e Analyticity: We will make one last assumption: that the free energy density is 
an analytic function of m(x) and its derivatives. Our primary interest lies in 
the critical point where m first becomes non-zero, although we will also use this 
formalism to describe first order phase transitions where m(x) is small. In both 
cases, we can Taylor expand the free energy in m and restrict attention to low 
powers of m. 


Furthermore, we will restrict attention to situations where m(x) varies rather 
slowly in space. In particular, we will assume that m(x) varies appreciably only 
over distances that are much larger than the distance a between boxes. This 
means that we can also consider a gradient expansion of f[m(x)], in the dimen- 
sionless combination aV. This means that Vm terms are more important than 
aV?m terms and so on. 


With these considerations in place, we can now simply write down the general form 
of the free energy. When B = 0, the symmetry (1.27) means that the free energy can 
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depend only on even powers of m. The first few terms in the expansion are 


1 1 1 
Fim(x)| = i Ean + poa(T)m* + x V(P (Vim) +... (1.28) 
There can also be a Fo(T) piece — like the T log 2 that appeared in the Landau free 
energy — which doesn’t depend on the order parameter and so, for the most part, will 
play no role in the story. Notice that we start with terms quadratic in the gradient: a 
term linear in the gradient would violate the rotational symmetry of the system. 


When B ¥ 0, we can can have further terms in the free energy that are odd in m, 
but also odd in B, such as Bm and Bm. Each of these comes with a coefficient which 
is, in general, a function of T. 


The arguments that led us to (1.28) are very general and powerful; we will see many 
similar arguments in Section 4. The downside is that we are left with a bunch of 
unknown coefficients a2(T), a4(T) and 7(T). These are typically hard to compute 
from first principles. One way forward is to import our results from Landau mean field 
approach. Indeed, for constant m(x) = m, the free energy (1.28) coincides with our 
earlier result (1.13) in Landau theory, with 


ao(T)~(T-T.) and ag(T)~ z (1.29) 


Happily, however, the exact form of these functions will not be important. All we will 
need is that these functions are analytic in T, and that a4(T) > 0 and q(T) > 0, while 
Q(T’) flips sign at the second order phase transition. 


Looking ahead, there is both good news and bad. The good news is that the path 
integral (1.25), with Landau-Ginzburg free energy (1.28), does give a correct description 
of the critical point, with the surprising d-dependent critical exponents described in 
Section 1.2.3. The bad news is that this path integral is hard to do! Here “hard” 
means that many of the unsolved problems in theoretical physics can be phrased in 
terms of these kinds of path integrals. Do not fear. We will tread lightly. 


1.3.2 The Saddle Point and Domain Walls 


We are going to build up slowly to understand how we can perform the functional 
integral (1.25). As a first guess, we’ll resort to the saddle point method and assume 
that the path integral is dominated by the configurations which minimise F’'|m(x)]. In 
subsequent sections, we'll treat the integral more seriously and do a better job. 
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To find the minima of functionals like F'|m(x)], we use the same kind of variational 
methods that we met when working with Lagrangians in Classical Dynamics . We take 
some fixed configuration m(x) and consider a nearby configuration m(x) + 6m(x). The 
change in the free energy is then 


jF = jes laom ôm + azm? 6m + vyVm - Vom| 
= jes [azm + azm? — yV?m] ôm 


where, to go from the first line to the second, we have integrated by parts. This 
encourages us to introduce the functional derivative, 


OF 


Fala) T 02720) + aam? (xe) — WV? (2x) 


Note that I’ve put back the x dependence to stress that, in contrast to F, 6F'/dm(x) 
is evaluated at some specific position x. 


If the original field configuration m(x) was a minimum of the free energy it satisfies 
the Euler-Lagrange equations, 


OF 


— =0 > yV?m = am + asm? (1.30) 
ôm 


m(x) 


The simplest solutions to this equation have m constant. This recovers our earlier 
results from Landau theory. When T > T., we have a2 > 0 and the ground state has 
m = 0. In contrast, when T < Te, a2 < 0 and there is a degenerate ground state 
m = +mpo with 


ERE a (1.31) 


This is the same as our previous expression (1.14), where we replaced a2 and a, with 
the specific functions (1.29). We see that what we previously called the mean field 
approximation, is simply the saddle point approximation in Landau-Ginzburg theory. 
For this reason, the term “mean field theory” is given to any situation where we write 
down the free energy, typically focussing on a Taylor expansion around m = 0, and 
then work with the resulting Euler-Lagrange equations. 


Domain Walls 


The Landau-Ginzburg theory contains more information than our earlier Landau ap- 
proach. It also tells us how the magnetisation changes in space. 
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Suppose that we have T < T, so there exist two degenerate ground states, m = +mo. 
We could cook up a situation in which one half of space, say x < 0, lives in the ground 
state m = —mp while the other half of space, x > 0 lives in m = +mpo. The two regions 
in which the spins point up or down are called domains. The place where these regions 
meet is called the domain wall. 


We would like to understand the structure of the domain wall. How does the system 
interpolate between these two states? The transition can’t happen instantaneously 
because that would result in the gradient term (Vm)? giving an infinite contribution 
to the free energy. But neither can the transition linger too much because any point at 


2 


which m(x) differs significantly from the value mg costs free energy from the m* and 


mt terms. There must be a happy medium between these two. 


To describe the system with two domains, m(x) must vary but it need only change 
in one direction: m = m(x). Equation (1.30) then becomes an ordinary differential 
equation, 

d'm F 
Tie = QM + ym 
This equation is easily solved. If we insist that the field interpolate between the two 


different ground states, m — FMmo as x — Foo, then the solution is given by 
r—xX 
m = mo tanh | ——— 1:32 
stanh (27% ) (1.22) 


This is plotted in the figure. Here X is the position of the ma 
domain wall and 


m 


is its width. For |z — X| > W, the magnetisation relaxes 


exponentially quickly back to the ground state values +mo. Figure 14: 


We can also compute the cost in free energy due to the presence of the domain wall. 
To do this, we substitute the solution back into the expression for the free energy (1.28). 
The cost is not proportional to the volume of the system, but instead proportional to 
the area of the domain wall. This means that if the system has linear size L then the 
free energy of the ground state scales as L4 while the additional free energy required 
by the wall scales only as L¢~!. It is simple to see that the excess cost in free energy 
of the domain wall has parametric dependence 


3 
d-1 | YQ 
Fran ~ LY "4 | a (1.33) 
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Here’s the quick way to see this: from the solution (1.32), the field m changes by 
Am ~ mo over a distance Ax ~ W. This means that dm/dx ~ Am/Az ~ mo/W 
and the gradient term in the free energy contributes F ~ L! f dx y(dm/dzx)? ~ 
L*1W-7(mo/W)?, where the second expression comes because the support of the inte- 
gral is only non-zero over the width W where the domain wall is changing. This gives 


2 and 


the parametric dependence (1.33). There are further contributions from the m 
mt terms in the potential, but our domain wall solves the equation of motion whose 
purpose is to balance the gradient and potential terms. This means that the potential 
terms contribute with the same parametric dependence as the gradient terms, a fact 


you can check by hand by plugging in the solution. 


Notice that as we approach the critical point, and a2 — 0, the two vacua are closer, 
the width of the domain wall increases and its energy decreases. 


1.3.3 The Lower Critical Dimension 


We stated above that the Ising model has no phase transition in d = 1 dimensions. 
The reason behind this can be traced to the existence of domain walls, which destroy 
any attempt to sit in the ordered phase. 


Let’s set a2(T) < 0 where we would expect two, ordered ground states m = +mpo. 
To set up the problem, we start by considering the system on a finite interval of length 
L. At one end of this interval — say the left-hand edge x = —L/2 — we'll fix the 
magnetisation to be in its ground state m = +mp. One might think that the preferred 
state of the system is then to remain in m = +mpo everywhere. But what actually 
happens? 


There is always a probability that a domain wall will appear in the thermal ensemble 
and push us over to the other ground state m = —mp. The probability for a wall to 
appear at some point x = X is given by (1.26) 

e 7 PF wal 

Z 
This looks like it’s exponentially suppressed compared to the probability of staying put. 
However, we get an enhancement because the domain wall can sit anywhere on the line 
—L/2 < x < L/2. This means that the probability becomes 
e PFwa J, 

Z W 
For a large enough system, the factor of L/W will overwhelm any exponential suppres- 
sion. This is an example of the entropy of a configuration — which, in this context is 


p[wall at x = X] = 


p|wall anywhere] = 


log(L/W) — outweighing the energetic cost. 
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Of course, once we’ve got one domain wall there’s nothing to stop us having another, 
flipping us back to m = +mpo. If we have an even number of domain walls along the 
line, we will be back at m = mo by the time we get to the right-hand edge at x = +L /2; 
an odd number and we'll sit in the other vacuum m = —mo. Which of these is most 
likely? 


The probability to have n walls, placed anywhere, is 


—nBF yall 1 L/2 L/2 L/2 1 L —BFwant Tu 

e e 

p|n walls - =! ax, f day... f dEn = ( ) 
| ZW" J_ry e ro Zn! W 


This means that the probability that we start at m = mg and end up at m = mo is 


1 1 L e7 PF wall n 1 L e PF wall 
plmo = mo] =F = T) = — cosh CT) 


W Z W 


n even 


Meanwhile, the probability that the spin flips is 


1 1 L e7 PF wall i 1 i L e7 PF wall 
pimo —> —mMo| = Z D al (=) = 7 sinh (=) 


n odd 


We see that, at finite L, there is some residual memory of the boundary condition that 
we imposed on the left-hand edge. However, as L — oo, this gets washed away. You’re 
just as likely to find the spins up as down on the right-hand edge. 


Although we phrased this calculation in terms of pinning a choice of ground state on 
a boundary, the same basic physics holds on an infinite line. Indeed, this is a general 
principle: whenever we have a Landau-Ginzburg theory characterised by a discrete 
symmetry — like the Zə of the Ising model — then the ordered phase will have a number 
of degenerate, disconnected ground states which spontaneously break the symmetry. 
In all such cases, the lower critical dimension is dı = 1 and in all cases the underlying 
reason is the same: fluctuations of domain walls will take us from one ground state to 
another and destroy the ordered phase. It is said that the domain walls proliferate. 


We could try to run the arguments above in dimensions d > 2. The first obvious 
place that it fails is that the free energy cost of the domain wall (1.33) now scales with 
the system size, L. This means that as L increases, we pay an exponentially large cost 
from e~*w!, Nonetheless, one could envisage a curved domain wall, which encloses 
some finite region of the other phase. It turns out that this is not sufficient to disorder 
the system. However, in d = 2, the fluctuations of the domain wall become important 
as one approaches the critical point. 
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1.3.4 Lev Landau: 1908-1968 


Lev Landau was one of the great physicists of the 20th century. He made important 
contributions to nearly all areas of physics, including the theory of magnetism, phase 
transitions, superfluids and superconductors, Fermi liquids and quantum field theory. 
He founded a school of Soviet physics whose impact lasts to this day. 


Landau was, by all accounts, boorish and did not suffer fools gladly. This did not sit 
well with the authorities and, in 1938, he was arrested in one of the great Soviet purges 
and sentenced to 10 years in prison. He was rescued by Kapitza who wrote a personal 
letter to Stalin arguing, correctly as it turned out, that Landau should be released as 
he was the only person who could understand superfluid helium. Thus, for his work 
on superfluids, Landau was awarded both his freedom and, later, the Nobel prize. His 
friend, the physicist Yuri Rumer, was not so lucky, spending 10 years in prison before 
exile to Siberia. 


Legend has it that Landau hated to 
write. The extraordinarily ambitious, multi- 
volume “Course on Theoretical Physics” 
was entirely written by his co-author Evgeny 
Lifshitz, usually dictated by Landau. As 
Landau himself put it: ” Evgeny is a great 
writer: he cannot write what he does not 
understand” . 


Here is a story about a visit by Niels 


Bohr to Landau’s Moscow Institute. Bohr 
was asked by a journalist how he succeeded Figure 15: 

in creating such a vibrant atmosphere in 

Copenhagen. He replied “I guess the thing is, I’ve never been embarrassed to admit to 
my students that I’m a fool’. This was translated by Lifshitz as: “I guess the thing is, 
I’ve never been embarrassed to admit to my students that they’re fools”. According 
to Kapista, the translation was no mistake: it simply reflected the difference between 
Landau’s school and Bohr’s. 


In 1962, Landau suffered a car accident. He survived but never did physics again. 
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2. My First Path Integral 


It’s now time to understand a little better how to deal with the path integral 
Z= f Dms) e PR lm) (2.1) 


Our strategy — at least for now — will be to work in situations where the saddle point 
dominates, with the integral giving small corrections to this result. In this regime, we 
can think of the integral as describing the thermal fluctuations of the order parameter 
m(x) around the equilibrium configuration determined by the saddle point. As we will 
see, this approach fails to work at the critical point, which is the regime we’re most 
interested in. We will then have to search for other techniques, which we will describe 
in Section 3. 


Preparing the Scene 


Before we get going, we’re going to change notation. First, we will change the name of 
our main character and write the magnetisation as 


m(x) > (x) 


If you want a reason for this, I could tell you that the change of name is in deference to 
universality and the fact that the field could describe many things, not just magnetisa- 
tion. But the real reason is simply that fields in path integrals should have names like 
@. (This is especially true in quantum field theory where m is reserved for the mass of 
the particle.) 


We start by setting B = 0; we'll turn B back on in Section 2.2. The free energy is 
then 


1 


Floto] = fate [BaT aT + AOV +... 


Roughly speaking, path integrals are trivial to do if F'|¢(x)] is quadratic in ¢, and 
possible to do if the higher order terms in F'|¢(x)] give small corrections. If the higher 
order terms in F'[¢(x)] are important, then the path integral is typically impossible 
without the use of numerics. Here we’ll start with the trivial, building up to the 
“possible” in later chapters. 
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To this end, throughout the rest of this chapter we will restrict attention to a free 
energy that contains no terms higher than quadratic in ¢(x). We have to be a little 
careful about whether we sit above or below the critical temperature. When T > Te, 
things are easy and we simply throw away all higher terms and work with 


F600) = 5 f de (1V6- Vo +120") (2.2) 
where u? = a2(T) is positive. 


A word of caution. We are ignoring the quartic terms purely on grounds of expedi- 
ency: this makes our life simple. However, these terms become increasingly important 
as u? ~ Q(T) — 0 and we approach the critical point. This means that nothing we are 
about to do can be trusted near the the critical point. Nonetheless, we will be able to 
extract some useful intuition from what follows. We will then be well armed to study 
critical phenomena in Section 3. 


When T < Ti, and a9(T’) < 0, we need to do a little more work. Now the saddle 
point does not lie at ¢ = 0, but rather at (¢) = +mpo given in (1.31). In particular, it’s 
crucial that we keep the quartic term because this rescues the field from the upturned 
potential it feels at the origin. 


However, it’s straightforward to take this into account. We simply compute the path 
integral about the appropriate minimum by writing 


(x) = (x) — (6) (2.3) 
Substituting this into the free energy gives 


FI) = Fling] +5 f ax [at (1)? + (TV)? +. (2.4) 


where now the ... include terms of order ¢? and ¢* and higher, all of which we’ve 
truncated. Importantly, there are no terms linear in ġ. In fact, this was guaranteed 
to happen: the vanishing of the linear terms is equivalent to the requirement that the 
equation of motion (1.30) is obeyed. The new quadratic coefficient is 


a (T) = (T) + 3mĉa,(T) = —202(T) (2.5) 


In particular, a4(T) > 0. 
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Practically, this means that we take the calculations that we do at T > Tẹ with the 
free energy (2.2) and trivially translate them into calculations at T < Te. We just 
need to remember that we’re working with a shifted @ field, and that we should take 
uw? = at(T) = |2a2(T)|. Once again, the same caveats hold: our calculations should 
not be trusted near the critical point u? = 0. 


2.1 The Thermodynamic Free Energy Revisited 


For our first application of the path integral, we will compute something straightforward 
and a little bit boring: the thermodynamic free energy. There will be a little insight to 
be had from the result of this, although the main purpose of going through these steps 
is to prepare us for what’s to come. 


We’ve already found some contributions to the thermodynamic free energy. There is 
the constant term Fo(7’) and, if we’re working at T < T., the additional contribution 
F'|mo] in (2.4). Here we are interested in further contributions to Fihermo, coming from 
fluctuations of the field. To keep the formulae simple, we will ignore these two earlier 
contributions; you can always put them back in if you please. 


Throughout this calculation, we’ll set B = 0 so we’re working with the free energy 
(2.2). There is a simple trick to compute the partition function when the free en- 
ergy is quadratic: we work in Fourier space. We write the Fourier transform of the 
magnetisation field as 


$k = f d'z e™™* (x) 


Since our original field ¢(x) is real, the Fourier modes obeys ¢{ = @_x. 


The k are wavevectors. Following the terminology of quantum mechanics, we will 
refer to k as the momentum. At this stage, we should remember something about our 
roots. Although we’re thinking of ¢(x) as a continuous field, ultimately it arose from 
a lattice and so can’t vary on very small distance scales. This means that the Fourier 
modes must all vanish for suitably high momentum, 


gy =0 for |k| >A 
Here A is called the ultra-violet (UV) cut-off. In the present case, we can think of 


A = T/a, with a the distance between the boxes that we used to coarse grain when 
first defining ¢(x). 
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We can recover our original field by the inverse Fourier transform. It’s useful to have 
two different scenarios in mind. In the first, we place the system in a finite spatial 
volume V = L”. In this case, the momenta take discrete values, 


2 
k= Sn ne Zt (2.6) 


and the inverse Fourier transform is 
1 ik-x 
(x) = y 2 e<* dy (2.7) 


Alternatively, if we send V — oo, the sum over k modes becomes an integral and we 
have 


In what follows, we'll jump backwards and forwards between these two forms. Ulti- 
mately, we will favour the integral form. But there are times when it will be simpler 
to think of the system in a finite volume as it will make us less queasy about some of 
the formulae we’ll encounter. 


For now, let’s work with the form (2.8). We substitute this into the free energy to 
find 


Fox] 


J d¢k, d*%kg 


= 9 (2Q7)4 (2Q7)4 jets (—7ki . kə + u’) Oki kə eilkit+ke)-x 


The integral over x is now straightforward and gives us a delta function 
jez ei(kitk2)-x = (2m) 454(ky + kə) 


and the free energy takes the simple form 


1 


Fol =3 f gy OPH adang y OEE e9) 


Now we can see the benefit of working in Fourier space: at quadratic order, the free 


energy decomposes into individual øk modes. This is because the Fourier basis is the 
eigenbasis of —yV? + p?, allowing us to diagonalise this operator. 
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To perform the functional integral, we also need to change the measure. Recall that 
the path integral was originally an integral over $(x) for each value of x labelling the 
position of a box. Now it is an integral over all Fourier modes, which we write as 


[ Peo = II [n f aoao] (2.10) 


I should warn you that this equation, as written, isn’t quite right because we should 
remember that $(x) is real, which means that ¢{ = ¢_,. So the measure above double 
counts the number of integrations we’re doing. We could try to amend our notation 
to take care of this, but it’s better to just remember what the measure above means. 
We won’t have to remember for long: the issue is resolved by the time we get to (2.11) 
below. 


I’ve included a normalisation constant M in the measure. Pll make no attempt to 
calculate this and, ultimately, it won’t play any role because, having computed the 
partition function, the first thing we do is take the log and differentiate. At this point, 
N will drop out. In later formulae, we’ll simply ignore it. But it’s worth keeping it in 
for now. 


Our path integral is now 


7 A B f dtk 
eT f inaa oof- f E oer) 


If this still looks daunting, we just need to recall that in finite volume, the integral in 


the exponent is really a sum over discrete momentum values, 


7 f O B 2 2 2 
Z = JIN | dona os ( TA (yk +?) 


k 
= I] [N f dodo exp (5 (yk? +p’) inl?) 
k 


Note that the placement of brackets shifted in the two lines, because the sum in the 
exponent got absorbed into the overall product. If this is confusing, it might be worth 
comparing what we’ve done above to a simple integral of the form f dx dy ery = 


(J dz e™)(f dy e=). 


We’re left with something very straightforward: it’s simply a bunch of decoupled 
Gaussian integrals, one for each value of k. Recall that Gaussian integral over a single 
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variable is given by 


+00 
f dr e ®/ = Ira (2.11) 


co 


Applying this for each k, we have our expression for the path integral 


2nTV 
Z= 
[I~ JEH 


where the square root is there, despite the fact that we’re integrating over complex @x, 
because of = @_, is not an independent variable. Note that we have a product over all 
k. In finite volume, where the possible k are discrete, there’s nothing fishy going on. 
But as we go to infinite volume, this will become a product over a continuous variable 
k. 


We can now compute the contribution of these thermal fluctuations to the ther- 
modynamic free energy. The free energy per unit volume is given by Z = e~@Fthermo 
or, 


Finermo _ Ty log Z = 7 be 2rT VN? 
V V 2V yk? + p? 


We can now revert back to the integral over k, rather than the sum by writing 


Pits | dik o [TVN 
(27)¢ qk? + p? 


V 2 
This final equation might make you uneasy since there an explicit factor of the volume 
V remains in the argument, but we’ve sent V — oo to convert from >>, to f d?k. At 
this point, the normalisation factor M will ride to the rescue. However, as advertised 
previously, none of these issues are particularly important since they drop out when we 
compute physical quantities. Let’s look at the simplest example. 


2.1.1 The Heat Capacity 


Our real interest lies in the heat capacity per unit volume, c = C/V. Specifically, we 
would like to understand the temperature dependence of the heat capacity. This is 
given by (1.15), 


82 L foo O a d'k TVN? 
=e = log | =~ 
C= 7 age SAN a TAF / (amy S (IRF p? 
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The derivatives hit both the factor of T in the numerator, and any T dependence in the 
coefficients y and y?. For simplicity, let’s work at T > Te. We’ll take y constant and 
u? =T—T,. A little bit of algebra shows that the contribution to the heat capacity 
from the fluctuations is given by 


e atk f E E 
CT 2 J Qn? yk? + pP (yk? + p?)? l 


The first of these terms has a straightforward interpretation: it is the usual “kp” per 
degree of freedom that we expect from equipartition, albeit with kg = 1. (This can be 
traced to the original 3 in e^.) 


The other two terms come from the temperature dependence in F'|¢(x)]. What 
happens next depends on the dimension d. Let’s look at the middle term, proportional 


A ka-1 
f "TE aa 
0 yk? + p? 


For d > 2, this integral diverges as we remove the UV cut-off A. In contrast, when 


to 


d = 1 it is finite as A + oo. When it is finite, we can easily determine the leading 
order temperature dependence of the integral by rescaling variables. We learn that 


i as At? whend>2 
0 yee +p 1/u  whend=1 


When d = 2, the term A° should be replaced by a logarithm. Similarly, the final term 
in (2.12) is proportional to 


[ ME ols At4 when d > 4 
0 (yk? + p?)? pt4* when d<4 
again, with a logarithm when d = 4. 


What should we take from this? When d > 4, the leading contribution to the heat 
capacity involves a temperature independent constant, A, albeit a large one. This 
constant will be the same on both sides of the transition. (The heat capacity itself is 
not quite temperature independent as it comes with the factor of T? from the numerator 
of (2.12), but this doesn’t do anything particularly dramatic.) In contrast, when d < 4, 
the leading order contribution to the heat capacity is proportional to p4~*. And, this 
leads to something more interesting. 
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To see this interesting behaviour, we have to do something naughty. Remember that 
our calculation above isn’t valid near the critical point, u? = 0, because we’ve ignored 
the quartic term in the free energy. Suppose, however, that we throw caution to the 
wind and apply our result here anyway. We learn that, for d < 4, the heat capacity 
diverges at the critical point. The leading order behaviour is 


d 
cw |T-—T,|-* with a= 2-5 (2.14) 


This is to be contrasted with our mean field result which gives a = 0. 


As we’ve stressed, we can’t trust the result (2.14). And, indeed, this is not the right 
answer for the critical exponent. But it does give us some sense for how the mean field 
results can be changed by the path integral. It also gives a hint for why the critical 
exponents are not affected when d > 4, which is the upper critical dimension. 


2.2 Correlation Functions 


The essential ingredient of Landau-Ginzburg theory — one that was lacking in the earlier 
Landau approach — is the existence of spatial structure. With the local order parameter 
(x), we can start to answer questions about how the magnetisation varies from point 
to point. 


Such spatial variations exist even in the ground state of the system. Mean field 
theory — which is synonymous with the saddle point of the path integral — tells us that 
the expectation value of the magnetisation is constant in the ground state 


0 aes 


(0(x)) = (2.15) 


+m TLT; 

This makes us think of the ground state as a calm fluid, like the Cambridge mill pond 
when the tourists are out of town. This is misleading. The ground state is not a single 
field configuration but, as always in statistical mechanics, a sum over many possible 
configurations in the thermal ensemble. This is what the path integral does for us. 
The importance of these other configurations will determine whether the ground state 
is likely to contain only gentle ripples around the background (2.15), or fluctuations so 
wild that it makes little sense to talk about an underlying background at all. 


These kind of spatial fluctuations of the ground state are captured by correlation 
functions. The simplest is the two-point function (¢(x)é(y)), computed using the prob- 
ability distribution (1.26). This tells us how the magnetisation at point x is correlated 
with the magnetisation at y. If, for example, there is an unusually large fluctuation at 
y, what will the magnitude of the field most likely be at x? 
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Because (¢(x)) takes different values above and below the transition, it is often more 
useful to compute the connected correlation function, 


(4(x) O(¥))c = (G(x) oly) — (4)? (2.16) 
If you’re statistically inclined, this is sometimes called a cumulant of the random vari- 


able (x). 


The path integral provides a particularly nice way to compute connected correlation 
functions of this kind. We consider the system in the presence of a magnetic field B, 
but now allow B(x) to also vary in space. We take the free energy to be 


Fle) = fate [Z0 + Forex) — Bola) (2.17) 


We can now think of the partition function as a functional of B(x). 


x)] = f Do e PF 


For what it’s worth, Z[B(x)] is related to the Legendre transform of F'|@(x)]. 


Now that Z depends on the function B(x) it is a much richer and more complicated 
object. Indeed, it encodes all the information about the fluctuations of the theory. 
Consider, for example, the functional derivative of log Z, 


ldlogZ | 
B ôB(x) -7 e 


-3J Do p(x) eF = ((x))z 


Here I’ve put a subscript B on (-}g to remind us that this is the expectation value 
computed in the presence of the magnetic field B(x). If our real interest is in what 
happens as we approach the critical point, we can simply set B = 0. 


Similarly, if we can take two derivatives of log Z. Now when the second derivative 
hits, it can either act on the exponent e~°”, or on the 1/Z factor in front. The upshot 
is that we get 


1 &lgZ — 1 Z = 1 64 êZ 
82 5B(x)dBly) BZ 5B(x)dB(y) ßB2?Z2őB(x)őB(y) 
1 &logZ 
FERRI) = (0y) (ly) 
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which is precisely the connected correlation function (2.16). In what follows, we’ll 
mostly work above the critical temperature so that (¢)s-0 = 0. In this case, we set 
B= 0 to find 


i ô? log Z 
5? ôB(x)ðB(y)| r-o 


All that’s left is for us to compute the path integral Z[|B(x)]. 


= ($(x)ġ(y)) (2.18) 


2.2.1 The Gaussian Path Integral 


As in our calculation of the thermodynamic free energy, we work in Fourier space. The 
free energy is now a generalisation of (2.9), 


dk 
Fig] = I ean É (yk? + u’) oxo — B_xox 


where By, are the Fourier modes of B(x). To proceed, we complete the square, and 
define the shifted magnetisation 


bx = k — 


We can then write the free energy as 


i de [1,5 aana 1 e 
Fil = | oop 5 (ok + u’) |x| ETE 


Our path integral is 
Z= [I f daði e- PF lou 
k 


where we’ve shifted the integration variable from ¢, to g; there is no Jacobian penalty 
for doing this. We’ve also dropped the normalisation constant M that we included in 
our previous measure (2.10) on the grounds that it clutters equations and does nothing 
useful. 


The path integral now gives 


Z[B(x)] = ePFinemo exp (EJ a |Bx|? 


2 2T)! yk? + p? 


The first term e~°*treme js just the contribution we saw before. It does not depend on 
the magnetic field B(x) and won’t contribute to the correlation function. (Specifically, 
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it will drop out when we differentiate log Z.) The interesting piece is the dependence 
on the Fourier modes By. To get back to real space B(x), we simply need to do an 
inverse Fourier transform. We have 


Z[B(x)| = e PFtnermo exp ( | tads B(x)G(x — y)B(y)) (2.19) 


where 


d¢k e7tk x 
G(x) = / one ie (2.20) 


We're getting there. Differentiating the partition function as in (2.18), we learn that 
the connected two-point function is 


U) = 5G -y) (2.21) 
We just need to do the integral (2.20). 


Computing the Fourier Integral 


To start, note that the integral G(x) is rotationally invariant, and so G(x) = G(r) with 
r = |x|. We write the integral as 


1 fdk = 
O= | ORETTE 


where we’ve introduced a length scale 


ea (2.22) 
This is called the correlation length and it will prove to be important as we move 


forwards. We’ll discuss it more in Section 2.2.3. 


To proceed, we use a trick. We can write 


1 [ 2 2 
a dt etH) 
k? +1/6 Jo 


Using this, we have 


1 dtk = —ik-x—t(k?+1/€? 
on) = fo | dt e eta 
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where, in going to the last line, we’ve simply done the d Gaussian integrals over k. 
At this point there are a number of different routes. We could invoke some special- 
functionology and note that we can massage the integral into the form of a Bessel 
function 


aoe =f dt bet 2 
2 Jo 

whose properties you can find in some dog-eared mathematical methods textbook. 

However, our interest is only in the behaviour of G(r) in various limits, and for this 

purpose it will suffice to perform the integral (2.23) using a saddle point. We ignore 

overall constants, and write the integral as 


r2 


ig t d 
~ dteS® with S(t)=—+—+-logt 
G(r) / e with S(t) mg + 5 log 


The saddle point t = t, sits at S’(t,) =0. We then approximate the integral as 


CA dt e~ (te) +9" (te )P/2 Slt) 
(r) / e ZS") e 


For us the saddle lies at 


There are two different limits that we are interested in: r >> € and r < €. We’ll deal 
with them in turn: 


r > €: In this regime, we have t, ~ r€/2. And so S(t,) ~ r/& + (d/2) log(r€/2). 
One can also check that S”(t,) = 2/r€?. The upshot is that the asymptotic form of the 
integral scales as 


1 eWr/é 
= €4/2-3/2 pd/2-1/2 


G(r) r >t 


At large distance scales, the correlation function falls off exponentially. 


r<&€: In the other regime, the saddle point lies at t, ~ r?/2d, giving S(t,) ~ 
d+ (d/2)log(r?/2d) and S”(t,) ~ 2d?/r*. Putting this together, we see that for r < €, 
the fall-off is only power law at short distances, 
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We learn that the correlation function changes its form at the distances scale r ~ €, 
with the limiting form 


i 
E ree 

(9(x)O(y)) ~ E (2.24) 
SEE ro>>€& 


This is known as the Ornstein-Zernicke correlation. 


2.2.2 The Correlation Function is a Green’s Function 


The result (2.24) is important and we we'll delve a little deeper into it shortly. But 
first, it will prove useful to redo the calculation above in real space, rather than Fourier 
space, to highlight some of the machinery hiding behind our path integral. 


To set some foundations, we start with a multi-dimensional integral over n variables. 
Suppose that y is an n-dimensional vector. The simple Gaussian integral now involves 
an invertible n x n matrix G, 

+00 
i d'y e737 = det? (21G) 
This result follows straighforwardly from the single-variable Gaussian integral (2.11), 
by using a basis that diagonalises Œ. Similarly, if we introduce an n-dimensional vector 
B, we can complete the square to find 
+00 

f d'y e7390 'YtBY = dett? (21G) e3P 9B (2.25) 
Now let’s jump to the infinite dimensional, path integral version of this. Throughout 
this section, we’ve been working with a quadratic free energy 


Flee) = fate [Fave + 51264) + Bodo (2.26) 
We can massage this into the form of the exponent in (2.25) by writing 
Flees) = f de | d'y 5oe)G x yoy) + | ate Ba) 


where we’ve introduced the “infinite dimensional matrix”, more commonly known as 
an operator 


G(x, y) = 6%(x— y) (7V3 +H’) (2.27) 


Note that this is the operator that appears in the saddle point evaluation of the free 
energy, as we saw earlier in (1.30). 
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Given the operator G~', what is the inverse operator G(x,y)? We have another 
name for the inverse of an operator: it is called a Green’s functions. In the present 
case, G(x, y) obeys the equation 


(—yV2+ W)G(x,y) = 6%(x-y) 


By translational symmetry, we have G(x, y) = G(x — y). You can simply check that 
the Green’s function is indeed given in Fourier space by our previous result (2.20) 


d¢k ek x 
oa) | (2m)4 yk? + p? 


This route led us to the same result we had previously. Except we learn something 
new: the correlation function is the same thing as the Green’s function, (@(x)¢(y)) = 
6B-*G(x,y), and hence solves, 


(7? + 12) (o(x)4(0)) = 70) 


This is telling us that if we perturb the system at the origin then, for a free energy of 
the quadratic form (2.26), the correlator (¢(x)¢(0)) responds by solving the original 
saddle point equation. 


There is one further avatar of the correlation function that is worth mentioning: it 
is related to the susceptibility. Recall that previously we defined the susceptibility in 
(1.19) as y = Om/OB. Now, we have a more refined version of susceptibility which 
knows about the spatial structure, 


6(¢(x)) 
ôB(y) 


But, from our discussion above, this is exactly the correlation function x(x, y) = 


x(x, y) = 


B(o(x)@(y)). We can recover our original, coarse grained susceptibility as 


ve f dêr x(x,0) = £ f dx (6(x)9(0)) (2.28) 


The two point correlation function will play an increasingly important role in later 
calculations. For this reason it is given its own name: it is called the propagator. 
Propagators of this kind also arose in the lectures on Quantum Field Theory. In that 
case, the propagator was defined for a theory in Minkowski space, which led to an 
ambiguity (of integration contour) and a choice of different propagators: advanced, 
retarded or Feynman. In the context of Statistical Field Theory, we are working in 
Euclidean space and there is no such ambiguity. 
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2.2.3 The Correlation Length 


Let’s now look a little more closely at the expression (2.24) for the correlation function 
which, in two different regimes, scales as 


1 
Ta PEE 
Aly) ~ (2.29) 
em™is 
Wen "> 
where r = |x — y|. The exponent contains a length scale, €, called the correlation 


length, defined in terms of the parameters in the free energy as €? = y/p?. 


We see from (2.29) that all correlations die off quickly at distances r >> €. In contrast, 
for r < € there is only a much slower, power-law fall-off. In this sense, € provides a 
characteristic length scale for the fluctuations. In a given thermal ensemble, there will 
be patches where the magnetisation is slightly higher, or slightly lower than the average 
(m). The size of these patches will be no larger than €. 


Recall that, close to the critical point, u? ~ |T —T,|. This means that as we approach 
T = T,, the correlation length diverges as 


1 


This is telling us that system will undergo fluctuations of arbitrarily large size. This is 
the essence of a second order phase transition, and as we move forward we will try to 
better understand these fluctuations. 


Numerical Simulations of the Ising Model 


It’s useful to get a sense for what these fluctuations look like. We start in the disordered 
phase with T > T. In the figures you can see two typical configurations that contribute 
to the partition function of the Ising modelë. The up spins are shown in yellow, the 
down spins in blue. 


On the left, the temperature is Ti > T., while on the right the temperature is 
To > Ti > Te In both pictures, the spins look random. And yet, you can see by 
eye that there is something different between the two pictures; on the right, when the 


° These images were generated by the Metropolis algorithm using a mathematica programme created 
by Daniel Schroeder. It’s well worth playing with to get a feel for what’s going on. Ising simulations, 
in various formats, can be found on his webpage http://physics.weber.edu/thermal/computer.html. 
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Figure 16: Spins with when T > Te Figure 17: Spins when T > Te 


temperature is higher, the spins are more finely intertwined, with a yellow spin likely 
to have a blue dot sitting right next to it. Meanwhile, on the left, the randomness is 
coarser. 


What you’re seeing here is the correlation length at work. In each picture, € sets the 
typical length scale of fluctuations. In the right-hand picture, where the temperature 
is higher, the correlation length is smaller. 


There is a similar story in the ordered phase, with T < T,. Once again, we show 
two configurations below. Now the system must choose between one of the two ground 
states; here the choice is that the yellow, up spins are dominant. The left-hand con- 
figuration has temperature T] < Te, and the right-hand configuration temperature 
T; < Ti < Te. We see that sizes of the fluctuations around the ordered phase become 
smaller the further we sit from the critical point. 
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Figure 18: Spins with when T < Te Figure 19: Spins when T < Te 
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Figure 20: T = T, 


Finally, we can ask what happens when we sit at the critical point T = Tẹ. A typical 
configuration is shown in Figure 20. Although it may not be obvious, there is now 
no characteristic length scale in the picture. Instead, fluctuations occur on all length 
scales, big and small®. This is the meaning of the diverging correlation length € — oo. 


Critical Opalescence 


There is a nice experimental realisation of these large fluctuations, which can be seen 
in liquid-gas transitions or mixing transitions between two different fluids. (Both of 
these lie in the same universality class as the Ising model.) As we approach the second 
order phase transition, transparent fluids become cloudy, an effect known as critical 
opalescence’. What’s happening is that the size of the density fluctuations is becoming 
larger and larger, until they begin to scatter visible light. 


More Critical Exponents 


We saw in previous sections that we get a number of power-laws at critical points, 
each of which is related to a critical exponent. The results above give us two further 
exponents to add to our list. First, we have a correlation length € which diverges at 
the critical point with power (2.30) 


1 


g~ Pen 


1 
h = = 
where v= 5 


©The best demonstration that I’ve seen of this scale invariance at the critical point is this Youtube 
video by Douglas Ashton. 
7You can see a number of videos showing critical opalescence on Youtube. For example, here. 
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Similarly, we know that the correlation function itself is a power law at the critical 
point, with exponent 


(6)0(y)) ~ sae where n =0 


Each of these can be thought of as a mean field prediction, in the sense that we are 
treating the path integral only in quadratic order, which neglects important effects near 
the critical point. Given our previous discussion, it may not come as a surprise to learn 
that these critical exponents are correct when d > 4. However, they are not correct in 
lower dimensions. Instead one finds 


| MF d=2 d= 
n| 0 1 0.0363 
y| 2 1 0.6300 


This gives us another challenge, one we will rise to in Section 3. 


2.2.4 The Upper Critical Dimension 


We’re finally in a position to understand why the mean field results hold in high di- 
mensions, but are incorrect in low dimensions. Recall our story so far: when T < Te, 
the saddle point suggests that 


(O(x)) = Emo 


Meanwhile, there are fluctuations around this mean field value described, at long dis- 
tances, by the correlation function (2.29). In order to trust our calculations, these 
fluctuations should be smaller than the background around which they’re fluctuating. 
In other words, we require (¢”) < (@)?. 


It’s straightforward to get an estimate for this. We know that the fluctuations decay 
after a distance r >> €. We can gain a measure of their importance if we integrate over 
a ball of radius €. We’re then interested in the ratio 


fa i dix (o(x)(0)) 7 1 T m rd—1 gi 


— e 
2ed d—2 2 
L diz må mE T mo 


In order to trust mean field theory, we require that this ratio is much less than one. 
This is the Ginzburg criterion. We can anticipate trouble as we approach a critical 
point, for here € diverges and mp vanishes. According to mean field theory, these two 
quantities scale as 


my ~|T-T|? and ~ |T -T| 
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results which can be found, respectively, in (1.31) and (2.30). This means that the 
ratio R scales as 


Rw IT — T, 679/2 


We learn that, as we approach the critical point, mean field — which, in this context, 
means computing small fluctuations around the saddle point — appears trustworthy 
only if 


d> de =4 


This is the upper critical dimension for the Ising model. Actually, at the critical di- 
mension d = 4 there is a logarithmic divergence in R and so we have to treat this case 
separately; we’ll be more careful about this in the Section 3. 


For dimensions d < 4, mean field theory predicts its own demise. We’ll see how to 
make progress in Section 3. 


2.3 The Analogy with Quantum Field Theory 


There is a very close analogy between the kinds of field theories we’re looking at here, 
and those that arise in quantum field theory. This analogy is seen most clearly in 
Feynman’s path integral approach to quantum mechanics’. Correlation functions in 


both statistical and quantum field theories are captured by partition functions 
Statistical Field Theory: Z= |v e BS ax Fld) 
Quantum Field Theory: Z = |v ers x L(G) 


You don’t need to be a visionary to see the similarities. But there are also some 
differences: the statistical path integral describes a system in d spatial dimensions, 
while the quantum path integral describes a system in d spacetime dimensions, or d— 1 
spatial dimensions. 


The factor of 7 in the exponent of the quantum path integral can be traced to the 
fact that it describes a system evolving in time, and means that it has more subtle 
convergence properties than its statistical counterpart. In practice, to compute any- 
thing in the quantum partition function, one tends to rotate the integration contour 
and work with Euclidean time, 


r= it (2.31) 


8You will see this in next term’s Advanced Quantum Field Theory course. 
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This is known as a Wick rotation. After this, the quantum and statistical partition 
functions are mathematically the same kind of objects 


Z= fo ets s Z= fos e- Sxl¢l/h 


where Spl|ọ] is the Euclidean action, and is analogous to the free energy in statisti- 
cal mechanics. If the original action S[ọ]| was Lorentz invariant, then the Euclidean 
action Sg|ọ] will be rotationally invariant. Suddenly, the d = 4 dimensional field the- 
ories, which seemed so unrealistic in the statistical mechanics context, take on a new 
significance. 


By this stage, the only difference between the two theories is the words we drape 
around them. In statistical mechanics, the path integral captures the thermal fluc- 
tuations of the local order parameter, with a strength controlled by the temperature 
28; in quantum field theory the path integral captures the quantum fluctuations of the 
field ¢, with a strength controlled by A. This means that many of the techniques we 
will develop in this course can be translated directly to quantum field theory and high 
energy physics. Moreover, as we will see in the next section, much of the terminology 
has its roots in the applications to quantum field theory. 


Note that the second order phase transition occurs in our theory when the coefficient 
of the quadratic term, vanishes: u? = 0. From the perspective of quantum field theory, 
a second order phase transition describes massless particles. 


Given that the similarities are so striking, one could ask if there are any differences 
between statistical and quantum field theories. The answer is yes: there are some 
quantum field theories which, upon Wick rotation, do not have real Euclidean actions. 
Perhaps the simplest example is Maxwell (or Yang-Mills) theory, with the addition of 
a “theta term”, proportional to e”? FF. This gives rise to subtle effects in the 
quantum theory. However, because it contains a single time derivative, it becomes 
imaginary in the 7 variable (2.31) and, correspondingly, there is no interpretation of 
e 5zl@l as probabilities in a thermal ensemble. 


A Different Perspective on the Lower Critical Dimension 


A statistical field theory in d = 1 spatial dimensions is related to quantum field 
theory in d = 0 spatial dimensions. But we have a name for this: we call it quantum 
mechanics. 
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Viewed in this way, the lower critical dimension be- w 
comes something very familiar. Consider the quartic po- 
tential V(x) shown in the figure. Classical considerations 
suggest that there are two ground states, one for each of the 


minima. But we know that this is not the way things work 7 | eS 


in quantum mechanics. Instead, there is a unique ground 


state in which the wavefunction has support in both min- Figure 21: 

ima, but with the expectation value (x) = 0. Indeed, the 

domain wall calculation that we described in Section 1.3.3 is the same calculation that 
one uses to describe quantum tunnelling using the path integral. 


Dressed in fancy language, we could say that quantum tunnelling means that the Zə 
symmetry cannot be spontaneously broken in quantum mechanics. This translates to 
the statement that there are no phase transitions in d = 1 statistical field theories. 
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3. The Renormalisation Group 


We've built up the technology of field theory and path integrals, and I’ve promised you 
that this is sufficient to understand what happens at a second order phase transition. 
But so far, we’ve made little headway. All we’ve seen is that as we approach the critical 
point, fluctuations dominate and the Gaussian path integral is no longer a good starting 
point. We need to take the interactions into account. 


Sometimes in physics, you can understand a phenomenon just by jumping in and 
doing the right calculation. And we will shortly do this, using perturbation theory 
to understand how the ¢* terms change the critical exponents. However, to really 
understand second order phase transitions requires something more: we will need to set 
up the right framework in which to think of physics at various length scales. This set of 
ideas was developed in the 1960s and 1970s, by people like Leo Kadanoff, Michael Fisher 
and, most importantly, Kenneth Wilson. It goes by the name of the Renormalisation 
Group. 


3.1 What’s the Big Idea? 


Let’s start by painting the big picture. As in the previous section, we’re going to 
consider a class of theories based around a single scalar field (x) in d dimensions. (We 
will consider more general set-ups in Section 4.) The free energy takes the now familiar 
form, 


Fld] = fats [3v6 ve ! T pig +... (3.1) 


In what follows, we will look at what happens as we approach the critical point from 
above T > T,. All the important temperature dependence in (3.1) is sitting in the 
quadratic term, with 


pewT —T, (3.2) 


In contrast to the previous section, we will allow u? to take either sign: u? > 0 in the 
disordered phase, and u? < 0 in the ordered phase where (¢) Æ 0. 


There is one important change in convention from our earlier discussion: we have 


rescaled the coefficient of the gradient term to be 1/2; we will see the relevance of this 
shortly. All other terms have arbitrary coefficients. 
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So far we’ve focussed on just a few couplings, as shown in the free energy (3.1). 
Here we’re going to expand our horizons. We’ll consider all possible terms in the free 
energy, subject to a couple of restrictions. We’ll insist that the free energy is analytic 
around ¢ = 0, so has a nice Taylor expansion, and we will insist on the Zə symmetry 
ġġ — —¢, so that only even powers of ¢ arise. This means, for example, that we will 
include the term ¢° and (V7¢)? and 6" and ¢!°7(V@¢- V¢)V7¢ and so so. Each of these 
terms comes with its own coupling constant. However, we don’t include terms like $1" 
because this violates the Zə symmetry, nor 1/¢? because this is not analytic at ¢ = 0. 


Next, consider the infinite dimensional space, parameterised by the infinite number 
of coupling constants. We will call this the theory space (although I should warn you 
that this isn’t standard terminology). You should have in your mind something like 
this: 


But possibly bigger. 


As we’ve seen, our interest is in computing the partition function 
Z= fo e Fel (3.3) 


Note that I’ve written the exponent as e~* rather than e76”. This is because the overall 
power of 6 does nothing to affect the physics; all the relevant temperature dependence 
is in the coefficient (3.2) while, for the quantities of interest near the critical point, this 
overall factor can be set to 6 ~ 1/T,. You can think that we’ve simply rescaled this 
into the field ¢. 


There is one more ingredient that we need to make sense of the path integral (3.3). 
This is the UV cut-off A. Recall that, implicit in our construction of the theory is the 
requirement that the Fourier modes ¢, vanish for suitably high momenta 


Oye =0 fork > A 
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This arises because, ultimately our spins sit on some underlying lattice which, in turn, 
was coarse-grained into boxes of size a. The UV cut-off is given by A ~ 1/a. 


Until now, the UV cut-off has taken something of a back seat in our story, although 
it was needed to render some of the path integral calculation in the previous section 
finite. Now it’s time for A to move centre stage. As we will explain, we can use the 
cut-off to define a flow in the space of theories. 


Suppose that we only care about physics on long distance scales, L. Then we have 
no real interest in the Fourier modes ¢, with k >> 1/L. This suggests that we can write 
down a different theory, that has a lower cut-off, 


ae 
¢ 


for some Ç. As long as A’ >> 1/LZ, the scale of interest, our theory can tell us everything 
that we need to know. Moreover, we know, at least in principle, how to construct such 
a theory. We write our Fourier modes as 


Pk = Pk +E 
where ġ} describe the long-wavelength fluctuations 


x de k<N 
k = P 
0 k>A 


and ¢j describe the short-wavelength fluctuations that we don’t care about 


> oa N<k<A 
Oy. = 


0 otherwise 


There are several other names for these variables that are used interchangeably. The 
modes ¢, and ¢; are also referred to as low- and high-energy modes or, importing 
language from quantum mechanics, slow and fast modes, respectively. In a rather 
quaint nod to the electromagnetic spectrum, the short-distance, microscopic physics 
that we care little about is often called the ultra-violet; the long-distance physics that 
we would like to understand is the infra-red. 


Similarly, we decompose the free energy, written in Fourier space, as 


F lox) = Folk] + Foleg] + Frlox, ok] 


= 55 = 


Here F;[¢,,,¢] involves the terms which mix the short and long-wavelength modes. 
The partition function (3.3) can then be written as 


Z = f J] 2% aTa f [I dog eri f I] dot e` Polok] oF rltx $x] 


k<A k<’ MN<k<A 


We write this as 
Z= f De eo Fl") 


where F'[ọ] is known as the Wilsonian effective free energy. (In fairness, this term 
is rarely used: you’re more likely to hear “Wilsonian effective action” to describe the 
analogous object in a path integral describing a quantum field theory.) We’re left with 
a free energy which describes the long-wavelength modes, but takes into account the 
effects of the short wavelength modes. It is defined by 


o-P'lO] — e-Folek] f JI dot Ee oi (3.4) 
A’ <k<A 


In subsequent sections, we’ll put some effort into calculating this object. However, at 
the end of the day the new free energy F’[¢~] must take the same functional form as 
the original free energy (3.1), simply because we started from the most general form 
possible. The only effect of integrating out the high-momentum modes is to shift the 
infinite number of coupling constants, so we now have 


F'(¢] = f d?x 379 -Vo + sug? +g +... (3.5) 


We would like to compare the new free energy (3.5) with the original (3.1). However, 
we're not quite there yet because, the two theories are different types of objects — like 
apples and oranges — and shouldn’t be directly compared. This is because the theory 
is defined by both the free energy and the UV cut-off and, by construction our two 
theories have different cut-offs. This means that the original theory F'|¢] can describe 


things that the new theory F’[¢~| cannot, namely momentum modes above the cut-off 
N. 


It is straightforward to remedy this. We can place the two theories on a level playing 
field by rescaling the momenta in the new theory. We define 


k’ = ¢k 
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Now k’ takes values up to A, as did k in the original theory. The counterpart of this 
scaling in real space is 


This means that all lengths scales are getting smaller. You can think of this step as 
zooming out, to observe the system on larger and larger length scales. As you do so, 
all features become smaller. 


There is one final step that we should take. The new theory F’[¢] will typically have 
some coefficient y” ~ 1 in front of the leading, quadratic gradient term. To compare 
with the original free energy (3.1), we should rescale our field. We define 


k = VY be 
Which, in position space, reads 
(x!) = VY p(x) (3.6) 


Now, finally, our free energy takes the form 
/ 1 / / 1 
Riol = f ate [ZVE Vol POO (3.7) 


We see that this procedure induces a continuous map from Ç € [1, 00) onto the space 
of coupling constants. Our original coupling constants in (3.1) are those evaluated at 
¢ = 1. As we increase Ç, we trace out curves in our theory space, that look something 
like the picture shown in Figure 22 


We say that the coupling constants flow, where the direction of the flow is telling us 
what the couplings look like on longer and longer length scales. The equations which 
describe these flows — which we will derive shortly — are known, for historic reasons, as 
beta functions. 


These, then are the three steps of what is known as the renormalisation group (RG): 
e Integrate out high momentum modes, A/¢ < k < A. 
e Rescale the momenta k’ = ¢k. 


e Rescale the fields so that the gradient term remains canonically normalised. 


TEE 


{~~ 


Figure 22: Flows in theory space; the arrows are in the direction of increasing Ç. 


You may wonder why we didn’t just include a coupling constant 7(¢) for the gradient 
term, and watch that change too. The reason is that we can always scale this away by 
redefining ¢. But ¢ is just a dummy variable which is integrated over the path integral, 
so this rescaling can’t change the physics. To remove this ambiguity, we should pin 
down the value of one of the coupling constants, and the gradient term (V¢@)? is the 
most convenient choice. If we ever find ourselves in a situation where y(¢) = 0 for 
some ¢ then we would have to re-evaluate this choice. (We’ll actually come across an 
example where it’s sensible to make a different choice in Section 4.3.) 


The “renormalisation group” is not a great name. It has a hint of a group structure, 
because a scaling by ¢, followed by a scaling by ¢ gives the same result as a scaling by 
¢1¢2. However, unlike for groups, there is no inverse: we can only integrate out fields, 
we can’t put them back in. A more accurate name would be the “renormalisation 
semi-group” . 


The Renormalisation Group in Real Space 


The procedure we’ve described above is the renormalisation group in momentum space: 
to get an increasingly coarse-grained description of the physics, we integrate out succes- 
sive momentum shells. This version of the renormalisation group is most useful when 
dealing with continuous fields and will be the approach we will focus on in this course. 


There is a somewhat different, although ultimately equivalent, phrasing of the renor- 
malisation group which works directly in real space. This approach works best when 
dealing directly with lattice systems, like the Ising model. As we explained rather 
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briefly in Section 1.3, one constructs a magnetisation field m(x) by coarse-graining 
over boxes of size a, each of which contains many lattice sites. One can ask how the 
free energy changes as we increase a, a procedure known as blocking. Ultimately this 
leads to the same picture that we built above. 


3.1.1 Universality Explained 


Even before we do any calculations, there are general lessons to be extracted from the 
framework above. Let’s suppose we start from some point in theory space. This can be 
arbitrarily complicated, reflecting the fact that it contains informations about all the 
microscopic, short-distance degrees of freedom. 


Of course, we care little about most of these details so, in an attempt to simplify 
our lives we perform a renormalisation group transformation, integrating out short dis- 
tance degrees of freedom to generate a new theory which describes the long wavelength 
physics. And then we do this again. And then we do this again. Where do we end up? 


There are essentially two possibilities: we could flow off to infinity in theory space, 
or we could converge towards a fixed point. These are points which are invariant under 
a renormalisation group transformation. (One could also envisage further possibilities, 
such as converging towards a limit cycle. It turns out that these can be ruled out in 
many theories of interest.) 


Our interest here lies in the fixed points. The second step in the renormalisation 
group procedure ensures that fixed points describe theories that have no characteristic 
scale. If the original theory had a correlation length scale €, then the renormalised 
theory has a length scale €’ = €/¢. (We will derive this statement explicitly below 
when we stop talking and start calculating.) Fixed points must therefore have either 
E=0 ore = oo, 


In the disordered phase, with T > Tą, enacting an RG flow reduces the correlation 
length. Pictorially, we have 


In this case, shrinking the correlation length is equivalent to increasing the temperature. 
The end point of the RG flow, at € = 0, is the infinite temperature limit of the theory. 
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This is rather like flowing off to infinity in theory space. As we will see, it is not 
uncommon to end up here after an RG flow. But it is boring. 


Similarly, in the ordered phase the RG flow again reduces the correlation length, 


RG flow 
os 


Now the end point at € = 0 corresponds to the zero temperature limit; again, it is a 
typical end point of RG flow but is dull. 


Theories with € = oo are more interesting. As we saw above, this situation occurs 
at a critical point where the theory contains fluctuations on all length scales. Now, if 
we do an RG flow, the theory remains invariant. In terms of our visual configurations, 


RG flow 
> 


100 200 200 


Note that the configuration itself doesn’t stay the same. (It is, after all, merely a 
representative configuration in the ensemble.) However, as the fluctuations on small 
distance scales shrink away due to RG, they are replaced by fluctuations coming in 
from larger distance scales. The result is a theory which is scale invariant. For this 
reason, the term “critical point” is often used as a synonym for “fixed point” of the 
RG flow. 


This picture is all we need to understand the remarkable phenomenon of universality: 
it arises because many points in theory space flow to the same fixed point. Thus, many 
different microscopic theories have the same long distance behaviour. 

Relevant, Irrelevant or Marginal 


It is useful to characterise the properties of fixed points by thinking about the theories 
in their immediate neighbourhood. Obviously, there are an infinite number of ways we 
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Figure 23: The critical surface. Irrelevant deformations from the critical point are shown 
as dotted blue lines; the relevant deformation is is red. 


can move away from the fixed point. If we move in some of these directions, the RG 
flow will take us back towards the fixed point. These deformations are called irrelevant 
because if we add any such terms to the free energy we will end up describing the same 
long-distance physics. 


In contrast, there will be some directions in which the RG flow will sweep us away 
from the fixed point. These deformations are called relevant because if we add any such 
terms to the free energy, the long-distance physics will be something rather different. 
Examples of relevant and irrelevant deformations are shown in Figure 23. Much of the 
power of universality comes from the realisation that the vast majority of directions 
are irrelevant. For a given fixed point, there are typically only a handful of relevant 
deformations, and an infinite number of irrelevant ones. This means that our fixed 
points have a large basin of attraction, huge slices of the infinite dimensional theory 
space all converging to the same fixed point. The basin of attraction for a particular 
fixed point is called the critical surface. 


Finally, it’s possible that our fixed point is not a point at all, but a line or a higher 
dimensional surface living within theory space. In this case, if we deform the theory 
in the direction of the line, we will not flow anywhere, but simply end up on another 
fixed point. Such deformations are called marginal; they are rare, but not unheard of. 
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Why High Energy Physics is Hard 


Universality is a wonderful thing if you want to understand the low-energy, long- 
wavelength physics. It tells you that you can throw away many of the microscopic 
details because they are irrelevant for the things that you care about. 


In contrast, if you want to understand the high-energy, short distance physics then 
universality is the devil. It tells you that you have very little hope of extracting any 
information about microscopic degrees of freedom if you only have access to information 
at long distances. This is because many different microscopic theories will all give the 
same answer. 


As we saw in Section 2.3, quantum field theory is governed by the same mathematical 
structure as statistical field theory, and the comments above also apply. Suppose, for 
example, that you find yourself living in a technologically adolescent civilisation that 
can perform experiments at distance scales of 10716 cm or so, but no smaller. Yet, what 
you really care about is physics at, say, 1073? cm where you suspect that something 
interesting is going on. The renormalisation group says that you shouldn’t pin your 
hopes on learning anything from experiment. 


The renormalisation group isn’t alone in hiding high-energy physics from us. In 
gravity, cosmic censorship ensures that any high curvature regions are hidden behind 
horizons of black holes while, in the early universe, inflation washes away any trace 
of what took place before. Anyone would think there’s some kind of conspiracy going 
on.... 


3.2 Scaling 


The idea that second order phase transitions coincide with fixed points of the renor- 
malisation group is a powerful one. In particular, it provides an organising principle 
behind the flurry of critical exponents that we met in Section 1. 


As we explained above, at a fixed point of the renormalisation group any scale must 
be washed away. This is already enough to ensure that correlation functions must take 
the form of a power-law, 


1 
(9(x)$(0)) ~ “rae (3.8) 
Any other function would require a scale on dimensional grounds. The only freedom 
that we have is in the choice of exponent which we have chosen to parameterise as n. 
One of the tasks of the RG procedure is to compute 7, and we will see how this works 
in Section 3.5. 
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However, even here there’s something of a mystery because usually we can figure out 
the way things scale by doing some simple dimensional analysis. (If you would like to 
refresh your memory, some examples of dimensional analysis can be found in Chapter 
3 of the lectures on Dynamics and Relativity.) What does that tell us in the present 
case? 


We will measure dimension in units of inverse length. So, for example, [x] = —1 
while [(0/Oz] = +1. The quantity F[¢] must be dimensionless because it sits in the 
exponent of the partition function as e~”. The first term is 


1 
Fjo] = fate 3V9: V+... 
From this we learn that 


lg] = —— (3.9) 


Which, in turn, tells us exactly what the exponent of the correlation function must be: 
7 = 0. 


This is sobering. Dimensional analysis is one of the most basic tools that we have, 
and yet it seems to be failing at critical points where experiment is showing that n Æ 0. 
What’s going on? 


A better way to think about dimensional analysis is to think in terms of scaling. 
Suppose that we rescale all length as x — x’ = x/¢. How should other quantities 
scale so that formula remains invariant? Stated this way, it’s clear that there’s a close 
connection between dimensional analysis and RG. The correlation function (3.8) is 
telling us that we should rescale (x) > ¢'(x’) = ¢4¢6(x), where 
_ a= 2+ 
~~ 2 
This is called the scaling dimension. It differs from the naive “engineering dimension” 


Ag (3.10) 


lọ] by the extra term 7/2 which is referred to as the anomalous dimension. 


We still haven’t explained why the scaling dimension differs from engineering dimen- 
sion. The culprit turns out to be the third step of the RG procedure (3.6) where the 
field @ gets rescaled. In real space, this is viewed as coarse-graining ¢@ over blocks of 
larger and larger size a. As we do so, it dresses ¢ with this UV cut-off scale A ~ 1/a, 
often in a complicated and non-intuitive way. This means that the correlation function 
(3.8) is actually 


a” 


(6()6(0)) ~ Se 
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which is in full agreement with naive dimensional analysis. We can work with usual 
engineering dimensions if we keep track of this microscopic distance scale a. But it is 
much more useful to absorb this into ¢ and think of a coarse-grained observable, with 
dimension Ag, that is the appropriate for measuring long distance correlations. 


3.2.1 Critical Exponents Revisited 


The critical exponents that we met in Section 1.2.3 are all a consequence of scale 
invariance, and dimensional analysis based on the scaling dimension. Let’s see how 
this arises. 


We know that as we move away from the critical point by turning on u? ~ T — T,, 
we introduce a new length scale into the problem. This is the correlation length, given 
by 


|T — Tl 


~t” ith t= 
E wi T, 


(3.11) 


Here t is called the reduced temperature, while v is another critical exponent that we 
will ultimately have to calculate. Since € is a length scale, it transforms simply as 
€ — &/¢. In other words, it has scaling dimension Ag = —1. The meaning of the 
critical exponent v is that the reduced temperature scales as t > Ct, with 


il 
=> .12 
A ; (3.12) 


In what follows, our only assumption is that the correlation length € is the only length 
scale that plays any role. 


We start with the thermodynamic free energy, Finermo(t), evaluated at B = 0. This 
takes the form 


Finermo(t) = f dis f(t) 


Because Fnermo is scale invariant at the fixed point, f(t) must have scaling dimension 
d, which immediately tells us that 


f(t) 


There is an intuitive way to understand this. At T close to T}, the spins are correlated 
over distances scales €, and can be viewed as moving as one coherent block. The free 
energy Finermo is extensive, and so naturally scales as F ~ (L/€)4 ~ t”. 
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From the thermodynamic free energy, we can compute the singular contribution to 
the heat capacity near t = 0. It is 


where the second relationship is there to remind us that we already had a name for the 
critical exponent related to heat capacity. We learn that 


a=2—-—dy (3.13) 
This is called the Josephson relation or, alternatively, the hyperscaling relation. 


The next critical exponent on the list is 6. Recall that this relates the magnetisa- 
tion in the ordered phase — which we used to call m and have now called ¢ — to the 
temperature as 


gmt? 
But the scaling dimensions of this equation only work if we have 


(d—2+n)v 
2 


Ag = BA: > B vg (3.14) 


The next two critical exponents require us to move away from the critical point by 
turning on a magnetic field B. This is achieved through the addition of a linear term 
f d‘z Bọ in the free energy. (We didn’t include such a linear term in our previous 
discussion of RG, but it can be added without changing the essence of the story.) The 
scaling dimensions of this term must add to zero, giving 


d+2-n 


AptAs=d > Apa—s 


Now we can look at the various relationships. The behaviour of the susceptibility near 
the critical point is 


me 
X= B |y 


E 
Once again, the scaling dimensions are enough to fix y to be 
Ay- Agne > y=r(2—n) (3.15) 


which is sometimes called Fisher’s identity. Once again, there is an intuitive way to 
understand this. The meaning of € is that the spins are no longer correlated at distances 
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r > &. This can be seen, for example, in our original formula (2.29). Using our earlier 
expression (2.28) for the susceptibility, we have 


i 1 
- d aa BOW py po HRT) 
X / d°x aT £ t 
which again gives y = v(2 — 7). 


The final critical exponent relates the magnetisation @ to the magnetic field B when 
we sit at the critical temperature t = 0. It should come as little surprise by now to 
learn that this is again fixed by scaling analysis 
We end up with four equations, relating a (3.13), 6 (3.14), y (3.15) and 6 (3.16) to the 
critical exponents 7 and v. For convenience, let’s recall what values we claimed this 


on B? > ô (3.16) 


exponents take: 


a b y n V 
—d 
MF | 5 j 1 0 4 
d=2). 0 ; i 15 i 1 
d=3 | 0.1101 0.3264 1.2371 4.7898 0.0363 0.6300 


where we’ve used the result (2.14), including quadratic fluctuations, for the mean field 
value of a. We see that the relations are satisfied exactly for d = 2 and to within the 
accuracy stated for d = 3. However, there’s a wrinkle because they only agree with the 
mean field values when d = 4! 


This latter point is an annoying subtlety and will be explained in Section 3.3.2. Our 
main task is to understand why the mean field values don’t agree with experiment when 
d <A. 


3.2.2 The Relevance of Scaling 


The kind of dimensional analysis above also determines whether a given interaction is 
relevant, irrelevant or marginal. 


Consider an interaction term O(x) in the free energy, 
Flø ~ f ate go. 0% (3.17) 


Here O can be ¢” or ¢™(V¢)? or any of the other infinite possibilities. In a spillover from 
quantum field theory, the different interaction terms O(x) are referred to as operators. 
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We’re interested in operators which, in the vicinity of a given point, transform simply 
under RG. Specifically, suppose that, under the rescaling x > x’ = x/¢, the operator 
has a well defined scaling dimension, transforming as 


O(x) + O'(x') = C4°O(x) (3.18) 


You can think of such operators as eigenstates of the RG process. From the free energy 
(3.17), the scaling dimension of the coupling is 


Aj=d—Ao 


Under an RG flow, these couplings scale as go > C4 4°go. We can see immediately 
that go either diverges or vanishes as we push forwards with the RG. Invoking our 
previous classification, O is: 


e Relevant if Ago < d 
e Irrelevant if Ao > d 
e Marginal if Ag =d 


The tricky part of the story is that it’s not always easy to identify the operators O 
which have the nice scaling property (3.18). As we’ll see in the examples below, these 
are typically complicated linear combinations of the operators ¢” and $"(V¢@)? and so 
on. 


3.3 The Gaussian Fixed Point 


It’s now time to start calculating. We will start by sitting at a special point in theory 
space and enacting the renormalisation group. At this special point, only two quadratic 
terms are turned on: 


1 1 gl 
ld = fate [ive vorge] =f EE taada 819 


where we’ve added a subscript to the coefficient yi? in anticipation the fact that this 
quantity will subsequently change under RG flow. 


Because the free energy is quadratic in œ, it has the property that there is no mixing 
between the short and long wavelength modes, and so factorises as 


Fole] = Fold] + Fole*] 


6 /.= 


Integrating over the short wavelength modes is now easy, and results in an overall 
constant in the partition function 


e- F'I97] = J Dore Pi" e Fol? | = Ne Fold] 


This constant M doesn’t change any physics; it just drops out when we differentiate 
log Z to compute correlation functions. However, we’re not yet done with the our RG; 
we still need to do the rescaling 


k’=C¢k and y =C" or (3.20) 


where w is constant that we will determine. Written in terms of the rescaled momenta, 
we have 


Aea d 
Folo] = f (27)4 5 (k ls HOOR Px 


A d¢k! 1 k2 
= apaa (E tA) Ott 


We can put this back in the form we started with if we take 


d+2 
w = —— 
2 


(3.21) 


leaving us with 


Fig] = J EE Lae + wee 
ol? | = (r)i 2 H k?-k 
The only price that we’ve paid for this is that the coefficient of quadratic term has 
become 


H’(C) = C° uo (3.22) 


This illustrates how the length scales in the problem transform under RG. Recall that 
the correlation length (2.22) is £? ~ 1/y?. We see that, under an RG procedure, 

4 

g=? 

Ç 
The fixed points obey du? /d¢ = 0. As we anticipated previously, there are two of them. 
The first is u? = oo. This corresponds to a state which has infinite temperature. It is 
not where our interest lies. The other fixed point is at 4? = 0. This is known as the 
Gaussian fixed point. 
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3.3.1 In the Vicinity of the Fixed Point 


As we mentioned previously, we would like to classify fixed points by thinking about 
what happens when you sit near them. Do you flow into the fixed point, or get pushed 
away? 


We already have the answer to this question in one direction in coupling space. If 
we add the term p?¢?, the scaling (3.22) tells us that u? gets bigger as we flow towards 
the infra-red. This is an example of a relevant coupling: turning it on pushes us away 
from the fixed point. 


Here is another example: it is simple to repeat the steps above including the term 
ao(V7¢)? in the free energy. Upon RG, this coupling flows as a(¢) = ¢~ag. It is an 
example of an irrelevant coupling, one which becomes less important as we flow towards 
the infra-red. 


More interesting are the slew of possible couplings of the form 


Fie = f as 


1 1 > 
ZVO: Vo + sued? +Y Iont" (3.23) 
n=4 


where, to keep the Zo symmetry, we restrict the sum to n even. Here things are a 
little more subtle because, once we turn these couplings, on the first step of the RG 
procedure is no longer so simple. Integrating out the short distance modes will shift 
each of these couplings, 


Jon > In = Gon + Ogn 


We will learn how to calculate the 6g, in section 3.4. But, for now, let’s ignore this 
effect and concentrate on the second and third parts of the RG procedure, in which we 
rescale lengths and fields as in (3.20). In this approximation, the operators ¢” enjoy 
the nice scaling property (3.18), 


x =x/¢ and g'(x) = C4*4(x) 
The free energy is then rescaled by 


F(¢’ — pee ee jews ; V'¢! ai T Ae S> gon"? g"” 


n=4 
To restore the coefficient of the gradient term, we pick the scaling dimension 


_d-2 
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Figure 24: RG flows when d < 4 Figure 25: RG flows when d > 4 


For once, the scaling dimension coincides with the engineering dimension (3.9): Ay = 
lọ]. This is because we’re looking at a particularly simple fixed point. Note that this 
is related to our earlier result (3.21) by Ag = d — w, with the extra factor of d coming 
from the f d¢k in the definition of the Fourier transform. 


Our free energy now takes the same form as before, 
1 1 2 
F / = d-t z IAI ‘ Vat _ 2 12 In 
(9 fa v ive VE + OA + do mlO 


where 


MIO = At aC n (3.24) 


We see that the way these coupling scale depends on the dimension d. For example, 
the coefficient for ¢* scales as 


gal) = 64" go,4 


We learn that ¢* is irrelevant for d > 4 and is relevant for d < 4. According to the 
analysis above, when d = 4, we have g4(¢) = go,4 and the coupling is marginal. In this 
case, however, we need to work a little harder because the leading contribution to the 
scaling will come from the corrections dg, that we neglected. We’ll look at this in the 
next section. 


Restricting to the plane of couplings parameterised by u? and g4, we see that (if we 
neglect the interactions) the RG flow near the origin is very different when d > 4 and 
d < 4. These are shown in the two figures. In the former case, we need to tune only 
u? ~ T — T, if we want to hit the fixed point; the other couplings will take care of 
themselves. In contrast, when d < 4 both of these couplings are relevant. This means 
that we would need to tune both to zero if we want to hit the Gaussian fixed point. 
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We can tally this with our discussion in Section 3.2. The fact that the scaling 
dimension A, coincides with the naive engineering dimension [¢] immediately tells us 
that 7 = 0. Meanwhile, the scaling of u? ~ t is given by A; = [g2] = 2, which tells us 
that v = 1/2. From this we can use (3.13) - (3.16) to extract the remaining critical 
exponents. These agree with mean field for d = 4, but not for d < 4. (We will address 
the situation in d > 4 in Section 3.3.2.) 


It is no coincidence that this behaviour switches at d = 4, which we previously 
identified as the upper critical dimension. In an experiment, one can always change u? 
by varying the temperature. However, one may not have control over the ¢* couplings 
which typically correspond to some complicated microscopic property of the system. If 
¢* is irrelevant, we don’t care: the system will drive itself to the Gaussian fixed point. 
In contrast, if 4 is relevant the system will drive itself elsewhere. This is why we don’t 
measure mean field values for the critical exponents: these are the critical exponents 
of the Gaussian fixed point. 


The coupling for ¢° scales as 


gel) = C G0,6 


This is irrelevant in d > 3, relevant in d < 3 and, naively, marginal in d = 3. 
Note that in dimension d = 2 all of the couplings gno” are relevant. 


So far, this all looks rather trivial. However, things become much more interesting 
at other fixed points. For example, around most fixed points Agn # nA@. Indeed, 
around most fixed points neither ¢ nor ọ” will have well defined scaling dimension; 
instead those operators to which one can assign a scaling dimension consist of some 
complicated linear combination of the ø”. We will start to understand this better in 
Section 3.4. 


The Meaning of Mean Field 


The meaning of the phrase “mean field theory” has evolved as these lectures have 
progressed. We started in Section 1.1.2 by introducing mean field as a somewhat dodgy 
approximation to the partition function. Subsequently, we used the expression “mean 
field theory” to mean writing down a free energy F[¢ġ] and focussing on the saddle point 
equations. This saddle point is a good approximation to the partition function only 
when the couplings are small; this is true only in the vicinity of the Gaussian fixed 
point. For this reason, using mean field theory is usually synonymous with working at 
the Gaussian fixed point, and ignoring the effect of operators like ¢* on fluctuations. 
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Figure 26: The phase diagram of the Figure 27: The phase diagram of the 
Ising model. liquid-gas system. 


(Of course, mean field still retains the øt term in the ordered phase, where it is needed 
to stabilise the potential.) 


Interactions that Break Z Symmetry 


Until now, we have have restricted ourselves to interactions ¢” with n even, to zealously 
safeguard the Zə symmetry ¢ > —@. One particularly nice aspect of RG is that if we 
restrict ourselves to a class of free energies that obey a certain symmetry, then we will 
remain in that class under RG. We’ll see examples of this in Section 3.4. 


However, suppose that we sit outside of this class and turn on interactions ¢” with n 
odd. The leading order effect is the magnetic field Bọ that we included in our original 
Ising model. This is always a relevant interaction. This means that if we want to hit 
the critical point, we must tune this to zero. 


It may be more natural to tune B = 0 in some systems that others. For example, 
a magnet in the Ising class automatically has B = 0 unless you choose to submit it to 
a background magnetic field. This means that it’s easy to hit the critical point: just 
heat up a magnetic and it will exhibit a second order phase transition. 


In contrast, in the liquid gas system, setting “B = 0” is less natural. Unlike in the 
Ising model, there is no Z2 symmetry manifest in the microscopic physics of gases. 
Instead, it is an emergent symmetry which relates the density of liquid and gas states 
at the phase transition. Correspondingly, if we simply take a liquid and heat it up then 
we’re most likely to encounter a first order transition, or no transition at all. If we want 
to hit the critical point, we must now tune the two relevant operators: temperature u? 
and pressure, which corresponds to the linear term with coefficient B. 


In both situations above we really need to tune two relevant couplings to zero to hit 
the critical point. Of these, one is even under Z, and one is odd under Z. Doing this 


= 2 = 


will allow us to hit a fixed point with two relevant deformations, one even one odd. 
This is the Gaussian fixed point in d > 4 and is something else (to be described below) 
ind < 4. 


What about higher order interactions 6” with n odd. If we have to tune ¢, do we not 
also need to tune ¢?? It turns out that that the ¢° interaction is redundant. If you have 
a free energy with no Zə symmetry, and all powers of 6”, then you can always redefine 
your field as 6 + @+ for some constant c. This freedom allows you to eliminate 
the ¢° term. Note that if your free energy enjoys the Za symmetry ¢ > —¢ọ then it 
prohibits you from making this shift. 


3.3.2 Dangerously Irrelevant 


We’ve learned that the ¢* interaction is irrelevant for d > 4, and so one can hit the 
Gaussian fixed point by tuning just one parameter: u? = 0. 


However, there’s one tricky issue that we haven’t yet explained: the mean field 
exponents agree with the scaling analysis of Section 3.2 only when d = 4. Comparing 
the two results, we have 


V 
MF 4 3 1 3 0 i4 
Scaling td 12 1 d2 0 5 


where we’ve used the result (2.14), including quadratic fluctuations, for the mean field 
value of a. This agrees with the scaling analysis. However, for d > 4, the exponents 
6 and ô differ. It turns out that the results from Landau mean field are correct, and 
those from the scaling analysis are wrong. Why? 


To understand this, let’s recall our scaling argument from Section 3.2. We set B = 0 
and focus on the critical exponent 6. The magnetisation scales with the temperature 
t= |T —T,|/T. as 


m~ t? 
Here m is identified with the scalar field ¢. Scaling analysis gives Ay = GA;. But both 


mean field and scaling analysis agree that A; = 1/v = 2 and Ag = (d — 2)/2, and this 
gives 6 = (d — 2)/4, rather than the mean field result 6 = 1/2. 
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However, we were a little quick in the scaling analysis because we neglected the 
quartic coupling g4. Mean field really told us (1.31), 


G M 
Mg ~ = 
g4 


But both t and g4 scale under RG flow. The scaling dimension of g4 is A,, = 4—d and 
now the mean field result, with 8 = 1/2 is fully compatible with scaling. 


There’s a more general lesson to take from this. It is tempting, when doing RG, 
to think that we can just neglect the irrelevant operators because their coefficients 
flow to zero as we approach the infra-red. However, sometimes we will be interesting 
in quantities — such as the magnetisation above — which have the irrelevant coupling 
constants sitting in the denominator. In this case, one cannot just blindly ignore 
these irrelevant couplings as they affect the scaling analysis. When this happens, the 
irrelevant coupling is referred to as dangerously irrelevant. 


3.3.3 An Aside: The Emergence of Rotational Symmetry 


This is a good point to revisit an issue that we previously swept under the rug. We 
started our discussion with a lattice model, but very quickly moved to the continuum, 
field theory. Along the way we stated, without proof, that we expect the long distance 
physics to enjoy rotational invariance and we restricted our attention to field theories 
with this property. Why are we allowed to do this? 


To make the discussion concrete, consider a square lattice in d = 2 dimensions. This 
has a discrete Z4 rotational symmetry, together with a Zə reflection symmetry. These 
combine together into the dihedral group Dg. 


Our field theory description will respect the Dg symmetry of the underlying lattice 
model, together with the Zə symmetry ¢ — —@ which ensures that fields come in pairs. 
But this would appear to be much less powerful than the full O(2) continuous rotation 
and reflection symmetry. Have we cheated? 


Let’s see what kind of terms we might expect. First, there are some simple terms 
that are prohibited by the dihedral symmetry. For example a lone term (0,¢)? would 
break the 7; — 22 discrete rotational symmetry and so would not appear in the free 
energy. Similarly, a term ¢0,¢ breaks the zı —> —2z, symmetry. (On top of this, it 
is also a total derivative and so doesn’t contribute to the free energy.) The lowest 
dimension term that includes derivatives and is compatible with the discrete symmetry 
is 


Oz ~ (31b)? + (026)? 
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But this term happens to be invariant under the full, continuous O(2) rotational sym- 
metry. We should keep going. The first term that preserves Dg, but not SO(2), is 


O, ~ b010 + G23 


There is no reason not to add such terms to the free energy and, in general, we expect 
that these will be present in any field theoretic description that accurately describes 
the microscopic physics. However, this operator has dimension Ao, = d+ 2 and so 
is irrelevant. This means that it gets washed away by the renormalisation group, and 
the long wavelength physics is invariant under the full O(2) symmetry. We say that 
the continuous rotational symmetry is emergent. A similar argument holds for higher 
dimensions. 


3.4 RG with Interactions 


The previous section left two questions hanging. What happens to the renormalisation 
of the coupling g4 in d = 4 dimensions? And where does the flow of g4 take us in d < 4 
dimensions? In this section we will answer the first of these. In Section 3.5 we will see 
that our analysis also contains the answer to the second. 


We now repeat our RG procedure, but with a different starting point in theory space, 
1 1 
Fld) = fes 3v0 “Vet arot + goo" 


The renormalisation group procedure tells us to split the Fourier modes of the field into 
long and short wavelengths, 


Pk = pk + OE (3.25) 
and write the free energy as 
Fd] = Folo] + Folo] + Frio, 67] 


where we take Fo[@] to coincide with the quadratic terms (3.19), and the interaction 
terms are 


Filo] = jes gob" 


Note that we’ve chosen to include, for example, (¢~)* in the interaction terms rather 
than Fo. This is a matter of convention. 
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The effective free energy for ,, defined in (3.4), is given by 


e- F'I6-] — e-Folek] f Dot e Poletle-Frlók 4 


There is a nice interpretation of this functional integral f Dor. We can think of it 
as computing the expectation value of e7*" [be OK |, treating Qi; as the random variable 
with Gaussian distribution e~/>l%]. In other words, we can write this as 


e-F'le] — e- Foley (e-Fil6. 641) 
+ 


where the subscript on (-); is there to remind us that we are averaging over the o} 
modes only. We take the definition of the path integral to be suitably normalised so 
that (1), = 1. Taking the log of both sides, 


F'[¢-] = Fo[d] — log (PRA (3.26) 


Our task is to compute this. 


We can’t do this functional integral exactly. Instead, we resort to perturbation 
theory. We assume that go is suitably small, and expand. (The dimensionless small 
parameter is goui4~*.) We first Taylor expand the exponential, 


z 1 
log (e~Fil#i#) =log (1 - Fr + 5 FP +...) 
+ 2 a 
and we then Taylor expand log(1 + x), to get 


log eel) = — (Fr), + ; IF?) — (Fr); | +... (3.27) 


where, in general, the n“ term is (—1)” x nt? cumulant of Fr. This also follows from 
the same kind of manipulations that we did at the beginning of Section 2.2. We will 
deal with each of terms above in turn. 


3.4.1 Order go 


At leading order in go, we need to compute (F HOME of]) be The first order of business 
is to expand the interaction terms (3.26) in Fourier modes. We have 


Filok, O¢] = Go (terms with œ) x (27)464 oe k;) 


There are five different “terms with @”, most of which do not give anything interesting. 
These five terms are: 
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i) Pq Pko Pk, Pk,: This term doesn’t include any oy, the average is trivial. It carries 
over to give the term go f dfx ¢~* in the effective free energy. 


ii) 4o,, Pie Peg Pica: This term has just a single ¢{ and so vanishes when averaged over 
the Gaussian ensemble. 


iii) 664, O%,%%,?x,:; This term is interesting. We will look at it more closely below. 
For now, note that the factor of 6 comes from the different choices of momentum 
labels. 


iv) 46, 04,04, %4,: This term is cubic in f and, like any term with an odd number 
of insertions, vanishes when averaged over the Gaussian ensemble. 


v) oi PPk: This term doesn’t include any $p, it simply gives constant to the 
free energy. It will not be important here. 


We learn that we need to compute just a single term, 


(Fi), = 69 I lI ii tati x (Oh), X CIIS k) (3.28) 


But this is the same kind of correlation function that we computed in Section 2.2: it is 
given by 
1 


=A 3.29 
k2 + uê (529) 


(doy), = (27) 8 (k +k')Go(k) with Golk) 
After playing around with the delta-functions, and relabelling momentum variables, 
we're left with our first correction to the free energy, 


AIC dik ^ diq 1 
F;). =6 — oe e 
Fr) w | (yd PK Ok I. (27)? q? + u 


where the limits on the f dq integral reflect the fact that we’ve only integrated out the 
short wavelength modes, whose momenta lie within this band. We see that, at order 
go, we get a correction only to the quadratic term whose coefficient becomes 


A d 
i d“q 1 
po pl? = p2 + 1290 f (3.30) 


A/¢ (27)4 q? + u 
Finally, we should enact the rescaling steps of the renormalisation group. This takes 


the same form as before (3.20), 


d+2 
k’=Ck and Prk =O" Pre with w= = 


= TT = 


This gives the same scaling of the parameters that we saw in Section 3.3. We have 


A d%q 1 


KO = C? (12 + 1290 [. Saas) and g(¢) = 64-4 go (3.31) 


The upshot of this calculation is that turning on a ¢* coupling will give rise to a 
quadratic ¢? coupling under RG flow. This is typical of these kinds of calculations: 
couplings of one type will induce others. 


The coefficient of the ¢? term is particularly important for our story, since the critical 
point is defined to be the place where this vanishes. We see that it’s not so easy to 
make this happen. You can’t simply set uô = 0 at some high scale and expect to hit 
criticality. Indeed, the result (3.31) tells us that, at long wavelengths, the “natural” 
value is u? ~ goA%?, which is typically large. If you want to hit the critical point, you 
must “fine tune” the original coefficient uê to cancel the new terms that are generated 
by RG flow. 


You might think that this calculation answers the question of what happens to the 
theory in d = 4 when we turn on g¢ġ*. It certainly tells us that turning on this coupling 
will induce the relevant coupling p?¢? and so take us away from the Gaussian fixed 
point. However, a closer look at (3.31) reveals that it’s possible to turn on a combination 
of go and puŝ, so that u? (C) remains zero. This combination is a marginal coupling. We 
learn that, at this order, there remains one relevant and one marginal deformation. 


3.4.2 Order gå 


The corrections to the $4 terms first arise at order g. Here we have the contribution 
a 1 
Pig") ~ = 5 (CEP - (Fn?) (3.32) 


Expanding out (F?), we find 256 different terms. We will see how to organise them 
shortly, but for now we make a few comments before focussing on the term of interest. 


Some of the terms in (F?) will result in corrections that cannot be written as a local 
free energy, but are instead of the form (f d“af (¢-))’° for some f(¢). These terms will 
be cancelled by the (Fr)? terms. This is a general phenomena which you can learn more 
about in the lectures on Quantum Field Theory. In terms of Feynman diagrams, which 
we will introduce below, these kind of terms correspond to disconnected diagrams. 


= [x= 


The terms that we care about in (F?) are those which can be written as local cor- 
rections to the free energy. Of immediate interest for us are a subset of terms in 


F? ~ f d'z d*y $*(x) ¢*(y), given by 


Lee AN JA eT dee li fae) ag 
ea ea] Sel Lon Oh). (3:38) 


x (2r) 8 (kı + ko + qi + q2)d%(k3 + ka + q3 + qu) 


Let’s explain what’s going on here. Each (x) is decomposed into Fourier modes @~ and 
@*. The same is true for each ¢(y). In the above term, we have chosen two ¢~ out of 
the ¢4(x) and two ¢ out of the ¢*(y); the remaining terms are ¢*. Each combinatoric 
factor ia = 6 out front reflects the choice of picking two ¢~ from ¢*. Meanwhile, 
the two delta functions come from doing the fd‘x and f d%y integrals respectively. 


Matching the momenta in the arguments of the delta functions to the @* tells us that 
we’ve picked two @ from the ¢*(x) and two @ from ¢4(y) (as opposed to, say, all 
four from ¢4(y)). 


To proceed, we need to compute the four-point function (69, 64,0¢,¢q,)+- To do this 


we need a result known as Wick’s theorem. 


Wick’s Theorem 


As we proceed in our perturbative expansion, the integrals start to blossom. From the 
form of the expansion (3.27), we can see that the integrand will involve expectation 
values of the form (oy, Sir OK, )+. There is a simple way to compute expectation values 
of this type in Gaussian ensembles. This follows from: 


Lemma: Consider n variables ¢, drawn from a Gaussian ensemble. This means that, 
for any function f(@), the expectation value is 


(N10) = 57 fae feeders 


for some invertible n x n matrix G. The normalisation factor is M = det!’ *(27G) and 
ensures that (1) = 1. The following identity then holds: 


(gmat) = e2Ba(bade) Bo (3.34) 


for any constant B®. 
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Proof: This is straightforward to show since we can just evaluate both sides 


Baba 1 = Ny, p-aP-G1O+B- 
(e ) N d”y e 


OO 
= ral dy e723 (¢-GB)-G"($-GB) ,5B-GB _ e22GB = e3 Ba(bads) By 
— 00O 


where, in the last step, we used the fact that (pab) = Gab- 


The Taylor expansion of the identity (3.34) gives us the expressions that we want. 
The left-hand-side is 


(Pte) = 1+ Ba ($a) + 5 BeBu(adn) + = Ba BrBe( Gadde) + 7 BaBsBeBulbadudeda) + - 


Meanwhile, the right-hand-side is 


1 1 1 
e2 Palao) Bo = 1+ 5 PaBr(ba%s) + 3 Pa By B-Bal babs) (Pca) te ya 


Now we just match powers of B, on both sides. We immediately learn that 
(¢a,--- 0a) =0 for l odd 


Our real interest is in l even. Here we have to be a little careful because multiplying by 
the string of B’s automatically symmetrises the products of (Pab) over the a = 1,...,n 
indices. So, for example, comparing the B* terms gives 


(dabobcha) = (Papo) (depa) a (abe) (Popa) + (baba) (doe) (3.35) 
It’s not hard to convince yourself that 
(Pa ae Paz) = (Oui Pas) tee (Paai Pass) =H symmetrised 


This leaves us with a sum over all pairwise contractions. This result is known as Wick’s 
theorem. 


Back to the Free Energy 
We can now apply Wick’s theorem to our free energy (3.33), 


( k L a qu)+ = ( qı An a aa v ae a wet a A 3 lE 


Recall that each of these propagators comes with a delta function, 


(64 66) + = (20)75%(q + q') Gola) 
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The trick is to see how these new delta functions combine with the original delta 
functions in (3.33). There are two different structures that emerge. The first term, 
gives (ignoring factors of 27 for now) 


4 
SII d'a; (65, oh) (Ood) +8 (ki + ke + qı + q2) ô (ks + ka + qs + qa) 


j=1 


~ [eee Go(q2)Go(qa) ôt(kı + kə) ôt(ks + k4) (3.36) 


We're still left with two delta functions over the k variables. This means that when 
we go back to real space, this term does not become a local integral. Instead, if you 
follow it through, it becomes a double integral of the form (f d4x 7 (x). As we 
explained after (3.32), these terms are ultimately cancelled by corresponding terms in 
(Fr)?. They are not what interests us. 


Instead, we care about the second and third terms in the Wick expansion of (P4 09,6454) +: 


Each of them gives a contribution of the form 
4 
JU d"q; ( z H E a t,)46%(ki + ko + qi + q2) ô (ks + k4 + q3 + qa) 
j=1 


~ | tade Golqı) Golq2) ôt(kı + k2 + qı + qo) ô? (ks + k4 — qi — q2) 


~ [ea Gold) Go(|ki + ke + ql) f(k + kə + kg + ka) (3.37) 


where, in going to the last line, we have done the integral f dq. and relabelled qı = q. 
Now we have just a single delta function over k; and, correspondingly, when we go back 
to real space this will give a local contribution to the free energy. Indeed, the terms 
(3.33) now become 


neno (2) 8 [TT [ES oi] s06 +49 OVE) (8) 


where the factor of ; in (3.33) has disappeared because we get two contributions from 
the Wick expansion, each of which gives the same contribution (3.37). The remaining 
integral over f d%q is hidden in the function f(k), which is given by 


A 
f(k, +k») = f T : (3.39) 


aye (27)? q? + ug (ki + ke + q)? + HG 
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This is not as complicated as it looks. We can write it as 


f(ki + ke) = f z : (1+ O(k,, k2)) 


aye (2m)? (q? + m)? 


All the terms that depend on the external momenta k, and kg will generate terms in the 
free energy of the form k?(¢~)* ~ (¢~)?V?(¢~)?. These are irrelevant terms that will 
not be interesting for us other than to note that once we let loose the dogs of RG, we 
will no longer sit comfortably within some finite dimensional subspace of the coupling 
constants. Integrating out degrees of freedom generates all possible terms consistent 
with symmetries; flowing to the IR allows us to focus on the handful of relevant ones. 


The contribution (3.38) to the free energy is what we want. Translating back to real 
space, we learn that the quadratic term gets corrections at this order. We have 


A d 
d°q 1 
m> = m- 3648 f 


aye (27) (q? + m)? omy) 


The minus sign in (3.40) is important. It can be traced to the minus sign in (3.32), 
and it determines the fate of the would-be marginal coupling g¢* in d = 4 dimensions. 
Recall that, in d = 4, there is no contribution to the running of g(¢) from the second 
and third steps of RG. Here, the leading contribution comes from the first step and, as 
we see above, this causes g(¢) to get smaller as ¢ increases. This means that the theory 
in d = 4 is similar in spirit to those in d > 4, with ¢* an irrelevant coupling. 


However, in d = 4, the RG flow for g happens much more slowly 2 
than other couplings. For this reason it is sometimes called marginally z d=4 
irrelevant, to highlight the fact that it only failed to be marginal when 
the perturbative corrections were taken into account. This is a general g 


phenomenon: most couplings which naively appear marginal will end — os 
up becoming either marginally relevant or marginally irrelevant due a 
to such corrections. In the vast majority of cases, the coupling turns 


out to be marginally irrelevant. However, there are a number of very 
important examples — the Kondo effect and non-Abelian gauge theories 
prominent among them — where a marginal coupling turns relevant. Figure 28: 


We’ll see such an example in Section 4.3. 


Finally, just because g is marginally irrelevant in d = 4 does not mean that you can 
turn it on and expect to flow back to the Gaussian fixed point. As depicted in the 
diagram, the coupling mixes with p?. If you want to flow back to the Gaussian fixed 
point, you need to turn on a particular combination of ug and gp. 
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3.4.3 Feynman Diagrams 


The calculation above needed some care. As we go to higher order terms in the ex- 
pansion, the number of possibilities starts to blossom. Fortunately, there is a simple 
graphical formalism to keep track of what’s going on. 


Suppose that we’re interested in a term in the expansion (3.27) of the form gh (d~)"(ot)!. 
This can be represented by one or more Feynman diagrams. Here is the game: 


e Each @, is represented by an external, solid line. 
e Each ¢{ is represented by a dotted line. 


e The dotted lines are connected with each other to form internal loops. They are 
paired up in all possible ways, reflecting the pair contraction of Wick’s theorem. 
No dotted lines can be left hanging which means that the diagrams only make 
sense for l even. 


e Each factor of go is represented as a vertex at which four lines meet. 

e Each line has an attached momentum k which is conserved as we move around 
the diagram. 

Each of these diagrams is really shorthand for an integral. The dictionary is as follows: 

e Each internal line corresponds to an insertion of the propagator CAND) i defined 
in (3.29) 

e For each internal loop, there is an integral f d%q/(27)*. 

e Each vertex comes with a power of go(27)767(5>,k;) where the delta function 
imposes momentum conservation, with the convention that all momenta are in- 
coming. 

e There are a bunch of numerical coefficients known as symmetry factors. 


This means that a term in the effective action of the form g}(¢~)” will correspond to 
a diagram with n external lines and p vertices. We can see what these diagrams look 
like for some of the terms that we’ve met so far. At order go, the rules don’t allow us 
to draw diagrams with an odd number of ¢,. The term x, Oy, 0_,?,, IN the expansion 
gives the trivial contribution to ¢*. In diagrams, it is 


ky 


_ oa fae (3.41) 


i, 
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wo 


Similarly, the (¢*)* terms are DK, Pk Pies Plea Oo sia ' but Hecapo these are internal 


lines we should join them up to get a diagram that looks like | 


The interesting term at order go is x, PPP a: which includes both ¢, and ġ¢. 
This was evaluated in (3.28). It is represented by the diagram 


we i E = agp T aaa | (oy 


where the integral arises because the momentum q of the ¢* excitation running in the 
loop is not determined by momentum conservation of the external legs. Such corrections 
are said to arise from loop diagrams, as opposed to the tree diagrams of (3.41). 


Finally, at order gZ, the correction (3.38) to the ¢* coupling comes from the 


k+k,+q 
kı wns ky 
W a oE | ak asd 
aK =ef TEE oe] Fs tka (2k) 
kA E Nk 0 i=1 i 


where the function f(k +k.) was given in (3.39) and contains two propagators associ- 
ated to the ¢* fields running in the loop. Evaluating this leads to the correction (3.40), 
as well as a slew of higher derivative couplings. 


Any graph that we can draw will appear somewhere in the expansion of log(e~/!%« i Ma. 
given in (3.27). As we noted above, this is a cumulant expansion which has the rather 
nice graphical interpretation that only connected graphs appear in the expansion. For 
example, at order gê, the disconnected graph that appears in the expansion of (F?) 4 
that looks like K K will be cancelled by the same disconnected graph appearing in 
=(P 


In the language of quantum field theory, it’s tempting to view the lines in the Feyn- 
man diagrams as the worldlines of particles. There is no such interpretation in the 
present case: they’re simply useful. 

More Diagrams 
We can now also look at other diagrams and see what role they play. For example, 


you might be worried about the diagram ——~- "a This is strictly vanishing, because 
os 
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the incoming momentum for lone ¢@ leg is forced to be equal to the intermediate 
momentum for the ¢* propagator. Yet the momentum for ¢~ and ¢* can never be 
equal . 


There are, however, two further diagrams that we neglected which do have an inter- 
esting role to play. Both of these are two loop diagrams. They will not affect what 
we're going to do later but, nonetheless, are worth highlighting. The first diagram is 


, A R dik 1 o 
S I f On 5 CUA) OP 
for some C(A) whose exact form will not be important. This gives rise to a shift in the 
quadratic term, so that (3.30) is replaced by 


A d 
d°q 1 
m >n? = p+ 1290 f 


Sk EEES o 
aera AN 


The second diagram is 


a > f dk 1 Ea 

a f r)i glk Apk One (3.42) 
We’ve called the result of this diagram A(k, A); again we won’t need its detailed form. 
Importantly, it is now a function of the external momentum k. This means that it gives 
rise to two effects that are (in a technical sense) relevant. The first is that there is yet 
another renormalisation of u?, this time depending on A(0,A). The second is novel: 
upon Taylor expanding A(k) = A(0) + k? A” (0) +..., we get a contribution to the the 
gradient term, which is now 


1 
Flø]= f dle 5'Vo-Vo+... with y =1-2924"0,A) 


This, in turn, means that we need a new rescaling of the field. To order gg, we can 
write this as 

Cae E 
1 — gA" (0, A) Pre 
This last, additional step is known as field renormalisation. (Actually, that’s not com- 
pletely true. It should be known as “field renormalisation”, but instead is known as 


kK =ck: and ġķ= 


“wavefunction renormalisation”. This a terrible name, one that betrays the long and 
deep confusion that permeated the origins of this subject. Even in the context of quan- 
tum field theory, this rescaling has nothing to do with wavefunctions. It is a rescaling 
of fields!) 
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Although we won’t compute this field renormalisation exactly, it is nonetheless im- 
portant for this is what gives rise to the anomalous dimension of ¢, and this was 
underlying the whole scaling analysis of Section 3.2. 


3.4.4 Beta Functions 


It is useful to write down equations which describe the flow of the coupling constants. 
These are first order differential equations which, for historic reasons are known as beta 
functions. It turns out to be convenient to parameterise the change in cut-off as 


N =-= Ae’ 
Ç 
The renormalisation group transformation described above tells us that each coupling 
changes with scale, gn = gn(s). The beta function is defined as 


dgn 


FA = Bn(gn) 


Note that s increases as we flow towards the IR. This means that a positive beta 
function tells us that g, gets stronger in the IR, while a negative beta function means 
that gn gets weaker in the IR. (As an aside: this is the opposite to how beta functions 
are sometimes defined in quantum field theory, where one parameterises the flow in 
terms of energy rather than length.) 


Before we jump straight in, it’s useful to take a step backwards and build up the 
beta functions. Let’s go back to our original scaling analysis around the Gaussian fixed 
point (3.24), where the running of the couplings is given by g,(s) = e¢-"4/2+™)8 gon. 
The beta functions are 

dgn 1 

— = |d- -nd+n | gn 3.43 

ds ( 2 ) 7 ( ) 
Notice that, at this leading order, there’s no mixing between different couplings: turning 
on one coupling gn does not induce another to flow. As we saw above, this state of 
affairs no longer holds when we include interactions. 


We now focus on the two most important couplings, 2 and go. At order go, the RG 
equations are given by (3.31); the additional correction at order gj, given in (3.40), 
means that these get replaced by 


RC) =G? (utag) and g(¢) =C lgo — bgi) (3.44) 
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where 


A d A d 
d 1 d 1 
ai — and b= 36 | a — 
aye (277) 4 q? + MG aye (27) (q? + uô) 


Note that we have kept our original scaling dimensions in (3.44); the corrections in 
scaling due to the diagram (3.42) will be subleading and not needed in what follows. 


When we differentiate u? (C) and g(¢) to derive the beta functions, we will get two 
terms: the first is (3.43) and comes from the scaling; the second comes from the cor- 
rections, given by the integrals a and b. Differentiating these integrals is particularly 
easy. For small s, we we write 


A d A 
f dq OSA- AIFA AFA > © | def) =A 
A A 


es ds e-s 
Let’s restrict to d = 4 dimensions. The beta function equations are 


dy? > 3g A dg 9 A 5 
ns d == 3.45 
ds Qn? A? + p? and Js Qn? (A? + yet ee) 


These don’t (yet) contain any new physics, but it’s worth reiterating what information 
we can extract from these equations. 


First, the beta function for u? has two terms; the first term comes from the second 
and third steps in RG (scaling), while the second comes from the first step in RG 
(integrating out). Meanwhile, the beta function for g has only a single term. There 
is no term linear in g because it was marginal under scaling, but it does receive a 
contribution when we integrate out the high momentum modes at order g?. This 
contribution is negative, which tells us the coupling is marginally irrelevant. (A repeat 
of the warning above: this is the opposite convention to quantum field theory where one 
flows in decreasing energy, rather than increasing length, which means that a marginally 
irrelevant interaction is usually said to have a positive beta function. ) 


3.4.5 Again, the Analogy with Quantum Field Theory 


The calculations above are very similar to the kind of loop integrals that you do in 
quantum field theory in d = 3+ 1 dimensions. There are, however, some philosophical 
differences between the approaches. 


= 37 = 


In statistical mechanics, the field ¢(x) is, by construction, a coarse grained object: 
at the microscopic level, it dissolves into constituent parts, whether spins or atoms or 
something else. This has the practical advantage that we have no expectation that 
the statistical field theory will describe physics on arbitrarily short distance scales. 
In contrast, when we talk about quantum field theory in the context of high energy 
physics, it is tempting to think of the fields as “fundamental”, a basic building block 
of our Universe. We may then wish for the theory to make sense down to arbitrarily 
small distance scales. 


This ambition leads to a subtly different viewpoint on renormalisation. In quantum 
field theory one must introduce a cut-off, as we have above, to render integrals finite. 
However, this cut-off is very often viewed as an artefact, one which we would ultimately 
like to get rid of and make sense of the theory as A —> oo. The trouble is that the 
renormalised quantities — things that we’ve called u? and g — typically depend on this 
cut-off. We saw this, for example, in (3.44). Often this makes it tricky to take the limit 
A> oo. 


To avoid this problem, one makes the so-called bare couplings — things we’ve called 
uó and go — depend on A. This is not such a dumb thing to do; after all, these quantities 
were defined at the cut-off scale A. The original game of renormalisation was to find a 
way to pick y2(A) and go(A) such that all physical quantities remain finite as A > oo. 
It is by no means obvious that this is possible. Theories which can be rendered finite 
in this way are said to be renormalisable. 


The high-energy approach to renormalisation predates the statistical physics ap- 
proach and is now considered rather old-fashioned. The idea that a theory needs to 
make sense up to arbitrarily high energy scales smacks of hubris. The right way to view 
renormalisation — whether in statistical mechanics or in high energy physics — is through 
the renormalisation group procedure that has been our main focus in this chapter, in 
which one integrates out short wavelength modes to leave an effective long-distance 
theory. 


Nonetheless, the high-energy approach to renormalisation has its advantages. Once 
one goes beyond the calculations described above, things are substantially easier with 
a high-energy viewpoint. You will learn more about these issues in the lectures on 
Advanced Quantum Field Theory. 

3.5 The Epsilon Expansion 


We have learned that ¢* interaction is irrelevant for d > 4 and relevant for d < 4, 
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Figure 29: The beta function for g when d < 4. The arrows show the flow of g as we move 
towards the infra-red. The fixed point lies within the regime of perturbation theory when 
d=4-e. 


sweeping us away from the Gaussian fixed point. But we seem to be no closer to 
figuring out where we end up. All we know is that we’re not in Kansas anymore. 


The difficulty is that we’re limited in what we can calculate. We can’t do the path 
integral exactly in the presence of ¢* interactions, and are forced to work perturbatively 
in the coupling g. Yet, as we have seen, in dimension d < 4 the RG flow increases g, 
taking us to a regime where perturbation theory is no longer valid. 


Nonetheless, the calculations that we did above do contain information about where 
we might expect to end up. But to see it, we have to do something rather dramatic. 
We will consider our theories not in d = 4 dimensions, but in 


d=4-—e 


dimensions where € is a small number, much less than 1. Clearly, this is an odd thing to 
do. You could view it as an act of wild creativity or one of utter desperation. Probably 
it is a little bit of both. But, as we shall see, it will give us the insight we need to 
understand critical phenomena. 


First, we should ask whether it makes sense to work in a non-integer dimension. 
The lattice models that we started with surely need to be defined in dimension d € Z+. 
Similarly, it was important for us that the free energy is local, meaning that it is written 
as an integral over space, and this too requires d € Zt. However, by the time we get 
to the beta function equations, it makes mathematical, if not physical, sense to relax 
this and work in arbitrary d € R. We can read off these beta functions from the RG 
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equations (3.44): they are 


du? o F 120) 4-1 Af oar 
ds" T Qn? Beer” 
dg. 36041 At 


= = ~2 | 
as = €g (Qn) (A? + 224 ee 


where Qg is the area of the sphere S41. We’ve introduced the dimensionless coupling 
g = Afg. Note that the beta function for g now includes a term linear in g arising 
from the scaling. 


It can be checked that the two-loop diagrams we neglected contribute only at order 
e. This means that it’s consistent to truncate to the beta functions above. We could 
use the general formula for the area of a sphere, Qg_1 = 274/?/T'(d/2), but this will give 
corrections of order e°, so instead we simply use Q3 = 27”. Similar comments apply to 
(27). We're left with 


du? 2 A 

a 2p g 
ds Qn? A? + u? 
gj 9 MM 


ds T On (A2? + meS 
The novelty of these beta functions is that they have two fixed points. There is the 
Gaussian fixed point u? = g = 0 that we discussed before. And there is a new fixed 


point, 
3 A4 Ir? (A? he u 
2 __ ~ ~ * 
H= “7 +e and J, = 9 mS 
Since we’re working to leading order in €, the solution is 
3 7 1 Z Qn? 
i= -paN e = -74e and g, = Ea 


This is the Wilson-Fisher fixed point. Importantly, when € is small then the fixed 
point g, is also small, so our calculation is self-consistent (although, since we are in a 
fractional dimension, arguably unphysical!). 


3.5.1 The Wilson-Fisher Fixed Point 


To understand the flows in the vicinity of the new fixed point, we write u? = u? + ôu? 
and g = 9, + og. Linearising the beta functions, we find 


d fop?\ (2=e«/3 37? (1+ §) ôu? 
ds \ $4 0 =¢ Og 
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where, as with all our other calculations, the entries in the matrix hold only up to 


O(e?). 


The eigenvalues of a triangular matrix coincide with the 
diagonal entries. We see that this fixed point has one positive 
and one negative eigenvalue, 


A, =2- 5 + O(c?) and A, = —-e+ O(c?) 


In other words, the Wilson-Fisher fixed point has one relevant 


direction and one irrelevant. The flows are shown in the figure. 


: ; : f Fi 30: 
We see that the epsilon expansion provides us with a global bai 


picture of the RG flows in d < 4 dimensions. One can check 
that all other couplings are also irrelevant at the Wilson-Fisher fixed point. Crucially, 
the fixed point sits at small g where our perturbative analysis is valid. 


Now suppose that we increase e. The Wilson-Fisher fixed point moves to higher g, 
and our perturbative approach breaks down. Nonetheless, it is not unreasonable to 
suppose that the qualitative picture of the flows remains the same. Indeed, this is 
thought to happen. Because the Wilson-Fisher fixed point has just a single relevant 
operator, it means that we will generically end up there if we we’re willing to tune just 
a single parameter, namely T —> T}. 


It is now a simple matter to compute the critical exponents in the epsilon expansion. 
Recall from Section 3.2 that these are related to the scaling dimensions of various 
terms. The relevant direction away from the Wilson-Fisher fixed point is temperature, 
t = |T — T,|/T.. Its dimension is determined by the way it scales as we approach the 
critical point, t > ¢4*t = e®“*t. But this is precisely the eigenvalue A; that we just 
computed. 


The critical exponent v, defined by € ~ t~”, is then given by (3.12) 


We can then use the hyperscaling relation a = 2 — dv, given in (3.13), to compute the 
critical exponent for the heat capacity 


c~t? with a= > + O(c?) 
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To compute the other critical exponents, we need to evaluate the anomalous dimension 
n. As we mentioned briefly above, this is related to the diagram = — and turns 
out to be 7 = €?/6, which is higher order in the expansion. We then have, from (3.10), 


d—2 € 
Age ——=1-5 


Equations (3.14), (3.15) and (3.16) then give 


1 € € 
=-->, xi+= , x3 
53 5 y + 6 TE 
where all expressions hold up to corrections of order O(e). 


Of course, our real interest lies in the system at d = 3, or e = 1. It would be in poor 
taste to simply plug in € = 1. But I know that you’re curious. Here’s what we get: 


a p y ò n V 
MF 0 i 1 3 0 5 
es1) 017 O33: Liy 4 0 0.58 
d=3 | 0.1101 0.3264 1.2371 4.7898 0.0363 0.6300 


Our answers are embarrassingly close to the correct values given the dishonest method 
we used to get there. One can, however, make this approach more respectable. The e 
expansion has been carried out to order O(e°). It is not a convergent series. Nonethe- 
less, sophisticated resummation techniques can be used to make sense of it, and the 
resulting expressions give a fairly accurate account of the critical exponents. 


The real power of the epsilon expansion, however, is more qualitative than quan- 
titative; it usually — but not always — gives a reliable view of the structure of RG 
flows. 


3.5.2 What Happens in d = 2? 


We have not yet discussed much about what happens in d = 2 dimensions. Here the 
story is somewhat richer. The first hint of this can be seen in a simple analysis of the 
Gaussian fixed point, which shows that 


Ag = [¢] =0 


This means that the Gaussian fixed point has an infinite number of relevant deforma- 
tions since $”, for each n, is relevant. 
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It turns out that, in contrast to d = 3, there are actually an infinite number of 
fixed points in d = 2. Roughly speaking, the n“ fixed point can be reached from the 
Gaussian fixed point by turning on 


F[¢] = fer (Vo) T Jaina Prrr) 


Of course, as we’ve seen above, the RG flow is not quite so simple. When we turn on 
the coupling g2n41)¢7"*) we will generate all other terms, including ¢? and ¢* and 
so on. To reach the n fixed point, we should tune all such terms to zero as we flow 
towards the infra-red. 


One can show that the n fixed point has n relevant operators: schematically, these 
can be thought of as ¢7,¢*,...,¢?" although, as above, there will be mixing between 
these. By turning on the least relevant operator, one can flow from the n? fixed point 
to the (n — 1)" fixed point. 


The results above are not derived using the € = 4—d expansion which, unsurprisingly, 
is not much use in d = 2. Instead, they rely on something new which we will briefly 
describe in Section 3.6. 


3.5.3 A History of Renormalisation 


“After about a month of work [at General Atomic Corp] I was ordered to 
write up my results, as a result of which I swore to myself that I would 
choose a subject for research where it would take at least five years before 
I had anything worth writing about. Elementary particle theory seemed to 
offer the best prospects of meeting this criterion.” 


Kenneth Wilson 


Renormalisation first entered physics in the context of quantum field theory, with 
the need to make sense of the UV divergences that arise in quantum electrodynamics. 
The theory, developed by Schwinger, Feynman, Tomonaga, Dyson and others, amounts 
to finding a consistent way to subtract one infinity from another, leaving behind a finite 
answer. This method yields excellent agreement with experiment but is, in the words 
of Feynman, a “dippy process”, in which the infinities are not so much understood as 
swept under a very large rug. 
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The first hint of something deeper — and the first hint of a (semi)-group action — was 
seen in the work of Gell-Mann and Low in 1954. They realised that one could define 
an effective charge of the electron, e(j) where u denotes the energy scale at which the 
experiment takes place. This interpolates between the physical charge, as u — 0, and 
the so called bare charge at high energies. 


Meanwhile, throughout the 50s and 60s, a rather different community of physicists 
were struggling to understand second order phase transitions. It had long been known 
that Landau theory fails at critical points, but it was far from clear how to make 
progress, and the results that we’ve described in this course took several decades to 
uncover. In Kings College London, a group led by Cyril Domb stressed the importance 
of focussing on critical exponents; at Cornell University, Benjamin Widom showed 
that the relationships between critical exponents could be derived by invoking a scale 
invariance, albeit with little understanding of where such scale invariance came from; 
and at the University of Illinois, Leo Kadanoff introduced the idea of “blocking” in 
lattice models, a real-space renormalisation group in which one worked with successively 
coarser lattice models. 


While many people contributed to these developments, the big picture, linking ideas 
from particle physics, statistical physics and condensed matter physics, is due mostly 
to... 


Kenneth Wilson: 1936-2013 


Ken Wilson received his PhD in 1961, working with Murray Gell-Mann on an assort- 
ment of topics in particle physics that fed his interest in the renormalisation group. He 
went on to spend much of the 1960s confused, scrabbling to understand the physics of 
scale, first in quantum field theory and later in the context of critical phenomena. He 
wrote very few papers in this time, but his reputation was strong enough to land him 
postdocs at Harvard and CERN and later even tenure at Cornell. (Some career advice 
for students: the strategy of being a genius and not writing anything rarely leads to 
such success. ) 


The floodgates opened in 1971 when Wilson set out his grand vision of the renormal- 
isation group and, with his colleague Michael Fisher, suggested the epsilon expansion 
as a perturbative method to compute critical exponents in a paper charmingly titled 
“Critical Exponents in 3.99 Dimensions”. Wilson used these methods to solve the 
“Kondo problem” in which an isolated spin, sitting in a bath of mobile electrons, ex- 
hibits asymptotic freedom, and he was among the first to understand the importance 
of numerical approaches to solve statistical and quantum field theories, pioneering the 
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subject of lattice gauge theory. In 1982 he was awarded the Nobel prize for his contri- 
butions to critical phenomena. 


3.6 Looking Forwards: Conformal Symmetry 


There are many questions that we have not yet answered? How do we know the critical 
exponents in d = 2 exactly? How do we know that there are an infinite number of fixed 
points in d = 2? Why are the critical exponents in d = 2 rational numbers while, in 
d = 3 they have no known closed form? How are we able to compute the d = 3 critical 
exponents to 5 significant figures? 


The answers to all these questions can be found in the emergence of a rich math- 
ematical structure at the critical point. As we’ve seen throughout these lectures, the 
basic story of RG ensures that physics at the critical point is invariant under scale 
transformations 


x + Ax (3.46) 


More surprising is the fact that the physics is invariant under a larger class of sym- 
metries, known as conformal transformations. ‘These transformation consist of any 
map 


oğ OXI 


x —> x(x) such that Jrk Əxi 


for some function ¢(x). Such conformal transformations have the property that they 
preserve angles between lines. 


The equation (3.47) has obvious solutions, such as translations and rotations, for 
which ¢(x) = 1. Furthermore, it is simple to see that the scaling (3.46) falls into the 
class of conformal transformations, with ¢(x) = A?. However, it turns out that there is 
one further, less intuitive transformation that obeys this condition. This is known as 
the special conformal transformation and is given by 

x’ —(x-x)a’ 


a er es Res oe) 


parameterised by an arbitrary vector a. 


The first question that we should ask is: why are theories at the fixed points invariant 
under the larger group of conformal transformations, rather than just scale transforma- 
tions? The answer to this, which goes somewhat beyond this course, involves a deeper 
understanding of the nature of the RG flows and hinges, crucially, on being able to 
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construct a quantity which decreases monotonically along the flow. This quantity is, 
unhelpfully, called c in d = 2 dimensions, f in d = 3 dimensions and a in d = 4 di- 
mensions, and the fact that it decreases monotonically is referred to as the c-theorem, 
f-theorem and a-theorem respectively. Using this machinery, it is then possible to 
prove that scale invariance implies conformal invariance. (The proof is more clearcut 
in d = 2; it relies on some extra assumptions in higher d, and there is a general feeling 
that there is more to understand here.) 


The existence of an extra symmetry (3.48) brings a newfound power to the study 
of fixed points. The d translational symmetries, sd(d — 1) rotational symmetries, d 
special conformal symmetries and single scale transformation combine to form the 
conformal group, which can be shown to be isomorphic to SO(d+ 1,1). All fields and, 
in particular, all correlation functions must fall into representations of this group, a fact 
which restricts their form. In recent years, our understanding of these representations 
has allowed new precision in the computation of critical exponents in d = 3 dimensions. 


This programme goes by the name of the conformal bootstrap. 


Conformal Symmetry in d = 2 


In d = 2 dimensions, conformal symmetry turns out to be particularly powerful. The 
group of finite conformal transformations follows the pattern in higher dimensions, and 
is SO(3,1) & SL(2,C). However, something rather special happens if you look at in- 
finitesimal transformations where one finds that many many more are allowed. In fact, 
there are an infinite number. This means that there is a powerful, infinite dimensional 
algebra, known as the Virasoro algebra, underlying conformal theories in d = 2 dimen- 
sions. This places much stronger constraints on these fixed points, ultimately rendering 
many of them solvable without resorting to perturbation theory. This is the reason why 
the critical exponents are rational numbers which can be computed exactly. This is 
also what allows us to understand the structure of the infinite number of multi-critical 
fixed points described in Section 3.5.2. 


Conformal field theory in d = 2 dimensions is a vast subject which arises in many 
different areas of physics. Although originally developed to understand critical phe- 
nomena, it also plays an important role in the lectures on the Quantum Hall Effect and 
the lectures on String Theory, where you can find an introduction to the basics of the 
subject. 
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4. Continuous Symmetries 


So far we have focussed almost exclusively on the Ising model. Now it is time to 
diversify. First, however, there is one more lesson to wring from Landau’s approach to 
phase transitions. . . 


4.1 The Importance of Symmetry 


Phases of matter are characterised by symmetry. More precisely, phases of matter are 
characterised by two symmetry groups. The first, which we will call G, is the symmetry 
enjoyed by the free energy of the system. The second, which we call H, is the symmetry 
of the ground state. 


This structure can be seen in the Ising model. When B = 0, the free energy has 
a G = Z symmetry. In the high temperature, disordered phase this symmetry is 
unbroken; here H = Zz also. In contrast, in the low temperature ordered phase, the 
symmetry is spontaneously broken as the system must choose one of two ground states; 
here H = Ø. The two different phases — ordered and disordered — are characterised by 
different choices for H. 


In the ordered phase we have two different ground states, whose phase diagram is 
reproduced on the next page. Whenever a discrete symmetry group like Zə is sponta- 
neously broken, it results in multiple ground states. One can move from one ground 
state to another by acting with the broken generators of G. 


In contrast, when B Æ 0 the free energy does not have a Zə symmetry, so G = Ø. 
According to Landau’s criterion, this means that there is only a single phase. Indeed, 
by going to temperatures T > T}, it is possible to move from any point in the phase 
diagram to any other point without passing through a phase transition, so there is no 
preferred way to carve the phase diagram into different regions. However, this also 
means that, by varying B at low temperatures T < Tę, we can have a first order phase 
transition between two states which actually lie in the same phase. This can also be 
understood on symmetry grounds because the first order transition does not occur at 
a generic point of the phase diagram, but instead only when G is enhanced to Zo. 


The discussion carries over identically to any system which lies in the Ising universal- 
ity class, including the liquid-gas system. This leaves us with the slightly disconcerting 
idea that a liquid and gas actually describe the same phase of matter. As with the 
Ising model, by taking a path through high pressures and temperatures one can always 
convert one smoothly into the other, which means that any attempt to label points in 
the phase diagram as “liquid” or “gas” will necessarily involve a degree of arbitrariness. 
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Figure 31: The phase diagram of the Figure 32: The phase diagram of the 
Ising model (again). liquid-gas system (again). 


It is really only possible to unambiguously distinguish a liquid from a gas when we sit 
on the line of first order phase transitions. Here there is an emergent G = Zə symme- 
try, which is spontaneously broken to H = Ø, and the two states of matter — liquid 
and gas — are two different ground states of the system. In everyday life, we sit much 
closer to the line of first order transitions than to the critical point, so feel comfortable 
extending this definition of “liquid” and “gas” into other regimes of the phase diagram, 
as shown in the figure. 


Beyond Ising 


The idea of symmetry, and of broken symmetry, turns out to be useful in characterising 
nearly all phases of matter. In each case, one should first determine an order parameter 
and a symmetry group G under which it transforms. Sometimes the choice of order 
parameter is obvious; sometimes it is more subtle. One then writes down the most 
general Landau-Ginzburg free energy, subject to the requirement that it is invariant 
under G. The different phases of matter within this class are characterised by the group 
H preserved by the ground state. 


There are a number of reasons why it is useful to characterise states of matter in 
terms of their (broken) symmetry. The original idea of Landau was that, as we’ve 
seen with the Ising model, symmetry provides a powerful mechanism to understand 
when a phase transition will take place. In particular, there must be a phase transition 
whenever H changes. 


However, it turns out that this is not the only thing symmetry is good for. As we 
will see below, knowledge of G and H is often sufficient to determine many of the low 
energy properties of a system, both through a result known as Goldstone’s theorem 
(that we will describe in Section 4.2) and through various topological considerations 
(some of which we will see in Section 4.4). 
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Finally, and particularly pertinent for this course is the role that symmetry plays in 
the renormalisation group and specifically in universality. One can ask: when do two 
systems lie in the same universality class? Although the full answer to this question is 
not yet understood, a fairly good guess is: when they share the same symmetry G. 


There are many different systems and choices of G that we could look at. A par- 
ticularly interesting class occurs when we take G = R? x SO(d), the group of spatial 
translations and rotations. The pattern of symmetry breaking provides, for example, a 
clean distinction between a liquid/gas and a solid, with the latter breaking G down to 
its crystal group. In this framework, there is not one solid phase of matter, but many, 
with each different crystal structure preserving a different H and hence representing 
a different phase of matter. The different breaking patterns of spatial rotations also 
allows us to define novel phases of matter, such as liquid crystals. Viewed in this way, 
even soap, which can undergo a discontinuous change to become slippery, constitutes 
a new phase of matter. We will not discuss this form of symmetry breaking in this 
course, but you can learn more about it in next term’s course on Soft Matter. 


Here, instead, we will be interested in phases of matter that are characterised by 
“internal” symmetry groups G that are continuous, as opposed to the discrete symmetry 
of the Ising model. This includes materials like magnets, where the spin is a vector 
that is free to rotate. It also includes more exotic quantum materials such as Bose- 
Einstein condensates, superfluids and superconductors. We will see that systems with 
continuous symmetry groups G exhibit a somewhat richer physics than we’ve seen in 
the Ising model. 


Beyond the Landau Classification 


The idea that phases of matter can be classified by (broken) symmetries has proven 
crucial in placing some order on the world around us. However, it is not the last 
word. Over the past twenty years, it has become increasingly clear that certain highly 
entangled quantum systems defy a simple characterisation by symmetry. The first, 
and most prominent, examples are the quantum Hall states. To understand these, one 
needs a new ingredient: topology. We will not touch upon this here, but you can read 
more in the lecture notes on the Quantum Hall Effect. 
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4.2 O(N) Models 


Phases of matter that are characterised by continuous, as opposed to discrete, symme- 
tries offer a rich array of new physics. The simplest such models contain N real scalar 
fields, which we arrange in a vector 


P(x) = (G1(X), a(x), .-., dv()) 
We will ask that the free energy is invariant under the O(N) symmetry 
a(x) > Ry'do(x) 
where R € O(N) so that RTR = 1. Now, when constructing the free energy we write 


down only the terms invariant under O(N). The first few are 


Flo] = fde [Zve Vot Fd 0+ 016-4)? 


where rotational invariance requires Vo - Vo = 0;6,0;¢q. These kind of theories are 
known, not unreasonably, as O(N) models. They are of interest for all N, but N = 2 
and N = 3 play particularly prominent roles. 


O(2): The XY-Model 


When N = 2, it is often convenient to pair the two real scalar fields into a single 
complex field 


P(x) = d(x) + tb2(x) 


The free energy now consists of all terms which are invariant under U(1) phase rotations, 
ap — e°. The first few terms are 


2 
Voie 
Fivea|= far [Zvw vus Siu? +alult +... (4.1) 
This is also known as the XY-model or, sometimes, the rotor model. 


There are at least two physical systems which sit in this universality class. The first 
are magnets where, in contrast to the Ising model, each atom has a continuous spin s 
which can rotate in a plane. (This is usually taken to be the X — Y plane, which is 
where the name comes from.) The microscopic Hamiltonian is the generalisation of the 
Ising model (1.1) to 


E=-JX si-s; (4.2) 
(ij) 
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where |s;| = 1. This is also written as 


E= -JX cos(6; — 0;) 
(ij) 
where 6; is the angle that the spin s; makes with some, fixed reference direction. Coarse- 
graining this microscopic model gives rise to the free energy (4.1). One could also add 
a magnetic field term }°,B-s;, where B is also a two-component vector. Such a term 
would break the O(2) symmetry, and introduce terms in (4.1) that are odd in w. 


The second physical system described by (4.1) is rather different in nature: it is a 
Bose-Einstein condensate, or its strongly coupled counterpart, a superfluid. Here, the 
origin of the order parameter w is rather more subtle, and is related to off-diagonal 
long-range order in the one-particle density matrix. In this case, the saddle point of 
the free-energy leads to the equation of motion 


YV? = wp + 4glp|p +... 
which is known as the Gross-Pitaevskii equation. 


It is sometimes, rather lazily, said that w(x) can be thought of as the macroscopic 
wavefunction of the system, and the Gross-Pitaevskii equation is then referred to as a 
non-linear Schrodinger equation. This is misleading for the simple reason that quantum 
mechanics is always linear. 


O(3): The Heisenberg Model 


The case N = 3 also describes magnets. The microscopic energy again takes the 
form (4.2), but now where each s; is free to rotate in three dimensions. This is referred 
to as the O(3) model or, alternatively, as the Heisenberg model. 


4.2.1 Goldstone Bosons 


The real novelty of continuous symmetries arises in the ordered phase, where ag < 0 
and, correspondingly, (@) # 0 in the ground state. For the Ising model, we had two 
possible choices: m = +m. The system had to pick one, and in doing so spontaneously 
broke the Zə symmetry. With a continuous symmetry, we have an infinite number of 
choices. The minimum of the free energy constrains only the magnitude of @ which is 
given by 


($) = Mo = 4/-F 


= 01 = 


However, minimising the free energy does not determine the direction of @. We are 
left with a space of ground states which is the sphere S-t. Each point on the sphere, 
parameterises the direction of @ and has the same energy. The configuration in which 
all the spins point like this: 


has the same energy as the configuration in which all the spins point like this: 


This infinitely degenerate choice of ground states gives us something new. We can 
consider configurations in which we stay within the space of ground states, but the 
direction varies in space. For such configurations, the part of the free energy f(@) = 
i |p|? + g|@|4+... remains minimised, but we pick up contributions from the gradient 
terms |V@|?. However, we can always lower this free energy by making the variation 
take place over longer and longer distances. The upshot is that there are excitations of 
the system that look like this 
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which can be made to cost an arbitrarily small amount of energy, by stretching the 
winding over longer and longer distance scales. 


These kind of excitations, which arise from the spontaneous breaking of continuous 
symmetries, are known as Goldstone bosons, or sometimes Nambu-Goldstone bosons. 
In the particular context of magnets, they are called spin waves. 


There is a dizzying array of names for these kind of excitations, reflecting their 
ubiquity and importance. In general, an excitation whose energy cost vanishes as the 
wavelength goes to infinity is referred to as a soft mode or, alternatively, is said to 
be gapless. These are to be contrasted with gapped excitations whose energy remains 
finite in this limit. In the context of quantum field theory, “gapless” = “massless”, and 
“capped” = massive”, with the energy gap coming from E = me. 


Gapless excitations often dominate the low-temperature behaviour of a system, where 
they are the only excitations that are not Boltzmann suppressed. In many systems, 
these gapless modes arise from the breaking of some symmetry. A particularly impor- 
tant example, that we will not discuss in these lectures, are phonons in a solid. These 
can be thought of as Goldstone bosons for broken translational symmetry. 


The Symmetry Behind Goldstone Bosons 


The intuitive idea described above can be placed on more rigorous footing in the form 
of Goldstone’s theorem. This states that, in any system the spontaneous breaking of 
a continuous symmetry gives rise to a gapless excitation, the eponymous Goldstone 
boson. This can be stated in language of group theory. 


For our O(N) model, the G = O(N) symmetry is broken by a choice of (@) to 
H = O(N —1). (To see this, note that if @ = (Mo, 0,...,0) then there is a surviving 
O(N — 1) symmetry which acts on the on the string of zeros.) The space of ground 
states has a group theoretic interpretation as the coset space 

gv-1 _ OWN) 

O(N — 1) 

This idea generalises. If a continuous symmetry G is spontaneously broken to H, then 
the manifold of ground states is given by G/H. We get a Goldstone boson for each 
broken symmetry generator, so the total number is 


# Goldstone Bosons = dim G — dim H 


For the O(N) model, G = O(N) and H = O(N —1) so the number of Goldstone modes 
is then $N(N — 1) — 3(N — 1)(N — 2) = N — 1, which is indeed the dimension of the 


sphere St, 
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Figure 33: The so-called ” Mexican hat” potential for the XY-model in the ordered phase. 


An Example: the XY-Model 


It’s simple to write some equations to go with the pictures above. Let’s start with the 
XY-model. In the ordered phase, we get a so-called Mexican hat potential, as shown 
in the figure. We can see that there is a circle, St of minima. It’s useful to decompose 
the field as y(x) = M(x)e®™. In the ground state M = My = ,/—p?/4g, while 0 is 
arbitrary. If we write 


M(x) = Mo + M(x) (4.3) 
Then the free energy has the expansion 
F(M, 6] = f ae LVM)? + UN? + git +... 


4 T MÈ(VO) + yMoM(V0)? +... (4.4) 


Here, the Goldstone boson is 0(x). There can be no terms of the form 6? or 64 arising 
in the free energy. Instead, it has only derivative interactions. 


Another Example: The Heisenberg Model 


For the O(3) model, we decompose the field in spherical polar coordinates, 


o = M (sin 0 cos ¢, sin 6 sin ¢, cos 0) 


— 104- 


with 6 € [0,7) and ¢ € [0, 27). Once again, in the ordered phase we have M = Mo # 0, 
with 6 and ¢ arbitrary. Expanding M as (4.3), the free energy now takes the form 


F[M, 0, ¢] = fas (VM) + WIMP + gM +... 


a J Mj [(V0)? + sin? 6 (V¢)?] +... (4.5) 
Here 6 and ¢ are the two Goldstone modes and, correspondingly, have only derivative 
interactions. Note, however, that this time the Goldstone modes interact with each 
other, as seen in the sin? 6 (V)? term. 


The kinetic terms for the Goldstone bosons above take the form of the metric on 
the two-sphere S?, i.e. ds? = dé? + sin? 6d¢?. This is no coincidence: the Goldstone 
bosons describe fluctuations around the minima of the free energy F'[@]. In the present 
case, this set of minima is S?, and this geometry gets imprinted on the dynamics of the 
Goldstone modes. We will explore this more in Section 4.3. 


Correlation Functions 


We saw in Section 2.2 that the quadratic term in the free energy is related (inversely) to 
the correlation length €. For Goldstone bosons this quadratic term necessarily vanishes 
and so they have infinite correlation length. 


This manifests itself in the correlation function, which decays as a power-law rather 
than exponential. This is simplest to see in the XY-model. (We will discuss O(N) 
models with N > 3 in more detail in Section 4.3.) The free energy (4.4) is 


F|] = jes SMEO? +... 
where the higher order terms are all derivatives and will not affect the discussion below. 
To compute the correlation function (0(x)@(y)), we can simply import the result (2.20). 
(There are some subtleties in doing the path integral because 6(x) is periodic, now 
valued in [0, 27) rather than R. These subtleties turn out not to be important here 
but we will revisit them in Section 4.4.) The long distance behaviour is 


dk eik-(x-y) 
(0) = Sa | GE (4.6) 


This is similar to the behaviour of the correlation function at the critical point. In- 
deed, a critical point can be thought of as having gapless excitations. But there are 
differences. 
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First, the power-law decay at critical point requires some fine-tuning of a parameter; 
we must pick the temperature to be exactly T = T}. In contrast, spontaneous symmetry 
breaking is more robust, and we get power-law decay for all T < Te. In other words, 
we have a phase with long range correlations, rather than just a point in the phase 
diagram. (For T > Te, where there is no symmetry breaking, all modes still decay 
exponentially as in the Ising model.) 


The second difference is that Goldstone bosons are much simpler to understand 
than the gapless modes at a critical point. As we have seen, at critical points the 
power-law decay of correlation functions suffers a correction due to integrating out 
short distance modes, resulting in the critical exponent 7 # 0. There are no such 
subtleties for Goldstone modes since all the dynamics is constrained by symmetry, and 
the correlation function (4.6). There are two caveats to this statement, both of which 
we will elaborate upon below. The first is that the simplicity only holds when T < Te; 
when we sit at the critical point T = Tẹ, things become interesting once again. The 
second caveat is that we have to be above the lower critical dimension for the Goldstone 
bosons to exist. 


4.2.2 The d = 4 — e Expansion 


At the critical temperature, T = T,, the O(N) models exhibit critical behaviour. The 
mean field approach to the O(N) model gives the same answer as we saw for the N = 1 
Ising field theory in previous sections. By now, you will not be surprised to learn that 
these mean field exponents are not always correct. However, the system now does not 
flow to the Ising critical point. Instead, they lie in a different universality class. 


First, in d = 2 there are no critical points with G = O(N) symmetry. We’ll see why 
in Section 4.2.3 and explore the physics more in Sections 4.3 and 4.4. 


In d = 3, the theories flow to a different critical point for each N. The critical 
exponents are known to be: 


MF 0 
Ising | 0.0363 0.6300 
N =2 | 0.0385 0.6719 
N = 3 | 0.0386 0.702 


NIF| v 


where the other critical exponents, a, 3, y and 6 all follow from the scaling relations 
that we saw in Section 3.2. 
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Figure 34: The heat capacity of helium at the superfluid transition. This system lies in the 
XY universality class. The data above is well described by the function C ~ Co + A|T — Te% 
with a ~ —0.16 and A < 0. 


While the values of 7 and v do not look very different from the Ising exponents, there 
is an important difference in the critical exponent for the heat capacity c ~ |T — T,|~°, 
which is given by a = 2 — 3v. For both the O(2) and O(3) transition, a is negative. 
For example, a ~ —0.16 for the O(2) transition. This means that the heat capacity 
exhibits a cusp, rather than a true divergence. 


For example, the superfluid transition of helium lies in the XY universality class. 
The heat capacity has long been known to exhibit cusp-like behaviour as shown in 
Figure 34°. This characteristic shape means that the second order superfluid transition 
is sometimes referred to as the “lambda transition”. It turns out that the accuracy 
in these experiments is limited by the effect of the Earth’s gravitational field. In 
the 1990s, these measurements were made on a space shuttle flight, in broad (but 
not perfect) agreement with theoretical prediction of c ~ Ax — Bt~® for the critical 


exponent œ ~ —0.16 and suitable coefficients A+ and B. 


The transition to Bose-Einstein condensate also sits in the XY universality class. 
This is a particularly clean system which allows precision experiments. For example, 
the data above shows the behaviour of the correlation length as a gas of ultracold 
rubidium-87 atoms passes through the critical point. The critical exponent is found 


°This data is taken from Buckingham and Fairbank, “The Nature of the Lambda Transition”, in 
Progress in Low Temperature Physics III, 1961. 
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Figure 35: The correlation length for a BEC at the critical point. This system lies in the 
XY universality class and has v = 0.67 + 0.13. 


to be v = 0.67 + 0.13, in good agreement with the theoretical prediction (albeit with 


fairly large error bars)’. 


It is not too difficult to repeat the RG calculations that we did in Sections 3.3 - 3.5 
for the O(N) model. As before, we rescale fields so that our order parameter — which 
we now call ¢,(x), with a=1,..., N — has free energy 


1 1 
BF[ġ] = jes Bg Va + 5 Ho gaba + go(paba)? +... 
The study of the Gaussian fixed point, at u4 = go = 0, goes through much as before. 
Indeed, a simple dimensional analysis argument tells us that 


d—2 
[a] = Ag = —— 


and, so 
13] =2 and [go] =4—d 


so that ug is always a relevant deformation, while go is relevant in d < 4 dimensions. 
So far, nothing depends on N. 


10This data is taken from the paper “Critical behaviour of a trapped interacting Bose gas” by T. 
Donner et. al., arXiv:0704.1439. 
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The differences arise in perturbation theory. The part of the free energy which mixes 
long and short wavelength modes is 


BFi{d] = f dit g(baba)? 


The presence of the internal indices, a = 1,..., N, means that the interaction has more 
structure than previously. To reflect this, we need to change our rules for drawing 
diagrams. First, each line should now be accompanied by an internal index a. Second, 
it is useful to split the interaction vertex as 


~ Go SabSed 


where the red ellipse splits the four legs into two pairs, each of which is a singlet under 
the O(N) symmetry, as shown in the delta function structure. (You may have to squint 
in some of the following pictures to see which pairs of legs are contracted.) 


Order go 


We can now run through our previous calculation to see how things change when we 
have N fields. At order gg, we previously found just a single diagram which renormalised 
uw”. Now the index structure means that splits into two different contributions. The 
first is: 


ene 4 
7 - d*k; _ _ 
i = 200 | T] (2r)? Par Paks X (Poka Phka) 4 X (27)457(S ki) 
ae i=1 i 


The other contribution has a different contraction between internal indices 
—_ = diki dsd 
i i p— pe + a+ 
| = go f II Qn Parr Pour X (Oates Pires), X Cr) O k) 
saa” i=1 7 


Note that the overall coefficients are 2 + 4 = 6, which agrees with our earlier counting 
(3.28). Each of these gives the same result we saw for a single scalar field, but with an 
important difference: the first diagram has an extra factor of N, arising from the fact 
that any of the N species can run in the loop. This is general result: any closed dotted 
loop gives an extra factor of N. 
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The rest of the calculation proceeds as in Section 3.4.1. We find that, at this order, 
we have a renormalisation of the quadratic term given by 


2 12 2 i d'q 1 
Ho >H =p +4(N +2) f = 
° i aye (2m)4 q? + u 
This agrees with our earlier result (3.30) when N = 1. 
Order gê 


A similar thing happens at the next order, with the single diagram at N = 1 splitting 
into three different diagrams, 


SOK KK 


Only the first of these has a closed loop, unattached to the external legs. This comes 
with a factor of N, while the other two do not. A careful computation of the relevant 
overall factors shows that these diagrams renormalise the quartic coupling as 


90 > 9 =9 Nta f a : 
0 = Jo = g 
j ° Jape (27)° (@ + ph)? 


Again, this reproduces our earlier result (3.40) when N = 1. 


The Epsilon Expansion 


We learn that the general structure of RG flow around the Gaussian fixed point is es- 
sentially the same as in the Ising field theory discussed in Section 3; only the coefficients 
of the beta functions differ. The same structure is also seen in the epsilon expansion. 
Working in dimension 


d=4-e 


the beta function equations become, at leading order in € and g, 


where, as in our earlier discussion, we’ve introduced the dimensionless coupling g = 
A~‘g. The analog of the Wilson-Fisher fixed point sits at 
s tava Qn? 


zan d = 
Mmana A ENa 


€ 
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Around this fixed point, the linearised beta functions take the form 


d (op? \ -Nte C ôu? 
ds \ 6g E 0 —€ og 


where the off-diagonal entry is C = S424? + os Ae. This does not affect the 


eigenvalues which are given by the diagonal entries, 


N+2 


A=2— 
i N+8 


e+O(e?) and A, = —€+O(e’) 


The interacting fixed point has one relevant and one irrelevant direction, just like for 
the Ising model. To leading order in €, the critical exponents are 


4—N 3 1 3 ia N+2 | 
Qa = — —E = — — — A = — 
O(N +8)’ 2 2N+8)’ 7 2(N +8) 
and 
N+2 
“tage 4.7 
a N+8° (en) 


Meanwhile 6 = 3 + € is independent of N, and the anomalous dimension turns out to 
be n = (N + 2)e?/2(N + 8) 


4.2.3 There Are No Goldstone Bosons in d = 2 


We learned in Section 1 that field theories have a lower critical dimension, in which 
the ordered phase ceases to exist. For theories characterised by any discrete symmetry, 
such as the Zə of the Ising field theory, this lower critical dimension is d; = 1. As we 
explained in (1.3.3), the lack of an ordered phase in d = 1 dimensions can be traced to 
the existence of domain walls. 


The story is different when we have continuous symmetries. Now there are no domain 
walls because the space of ground states is continuously connected. However, there is 
a more prominent phenomena which means that the lower critical dimension is raised 
to dy = 2: 


An Example: the XY model 


The lack of ordered phase in d = 2 dimensions arises due to the presence of the would- 
be Goldstone modes. This is simplest to explain in the XY model. Let’s sit in the 
broken phase and focus only on the Goldstone bosons. We must pick a ground state 
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for 0: this is the essence of spontaneous symmetry breaking. Let’s choose (6(x)) = 0. 
We now look at fluctuations around this ground state, 


([0(x) — 8(0)]?) = 2(4(x)?) — 2(0()0(0)) (4.8) 
From the correlation function (4.6), the long distance behaviour is 


Apt ee 


A d e7ikx 
(0(x)0(0)) = 7 ie f oy a) log(Ar) d=2 (4.9) 
r— A7! d= 


with r = |x]. 


We see that there is a qualitative difference between d > 2 and d < 2. For d > 2, 
the two point correlator (0(x)0(0)) decays to a constant as r — oo. This constant is 
cancelled by the other term in (4.8), which means that the phase returns to its original 
value (0) = 0. 


In contrast, for d < 2, the fluctuation of the phase grows indefinitely as we go to 
larger distances. You may have thought that you had placed the system in a fixed 
ground state, but the thermal fluctuations of the Goldstone mode mean that it doesn’t 
stay there. The interpretation is that there is no ordered phase with (@) # 0 in d = 2 
dimensions or below. 


This is a general result, known as the Mermin-Wagner theorem. A continuous sym- 
metry cannot be spontaneously broken in d = 2 dimensions or below: there are no 
Goldstone bosons in d = 2 dimensions. 


This leaves us with a delicate question. The existence of the gapless Goldstone 
modes was predicated on the idea of spontaneous symmetry breaking. But for d < 2 
dimensions, no such symmetry breaking happens. What, then, is the resulting physics? 


In d = 1, the physics is straightforward: there are no gapless modes. As before, this 
can also be understood in the language of quantum mechanics, where the spectrum 
of a particle moving on S‘~! is discrete and gapped. For d = 2, the physics is more 
interesting. It turns out that the answer is somewhat different for O(N) models with 
N > 3 and for the XY-model with N = 2. We will discuss the fate of the Goldstone 
modes for each of these in Sections 4.3 and 4.4 respectively. 
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4.3 Sigma Models 


We have still to understand the fate of the Goldstone bosons in d = 2 dimensions. In 
this section, we will tell their story. As a spin-off, we will see that we also a get a new 
handle on the critical point in d = 3 dimensions. 


We place ourselves firmly in the ordered phase, with T < Te. Mean field consider- 
ations tell us that (|@|) # 0, leaving us with a space of possible ground states which 
is identified with the sphere St. As we saw in Section 4.2.1, fluctuations in the 
directions parallel to the S-t have only power-law decay; these are the Goldstone 
modes. In contrast the “longitudinal” fluctuation, in which d@ ~ (@), acts very much 
like in the Ising model and has exponential decay with a correlation length € 4 0. This 
suggests that the long distance dynamics is dominated purely by the Goldstone modes. 


Here we will study the theory of these Goldstone modes. First, rather than working 
with @- @ = Mẹ, we rescale the field @(x) to a new field, n(x) which has unit length, 


n-n=1 (4.10) 


For now, we will keep the dimension d arbitrary. The free energy is given by 
1 
= d 


where the coefficient e? = 1/yM@ is the price that we pay for rescaling n to be a unit 
vector. The free energy (4.11) looks like that of a free theory; all the interactions come 


from the constraint (4.10) which say that the fields n(x) must lie on the unit sphere 
ora 


The theory defined by (4.11), together with the constraint (4.10), lies in a class 
of theories referred to as non-linear sigma models. These are theories in which the 


fields can be viewed as coordinates on some manifold M. In the present context, this 
manifold is M = S7., 


We would like to understand the path integral for the sigma model. Schematically, 
this can be written as 


J= [om 6(n(x)? — 1) exp (= jets Vn: vn) (4.12) 


Here we’ve imposed the constraint through a delta function. Note that the only coef- 
ficient in the game is e°; this will play the role of our coupling constant. Recall that, 
long ago, before we grew up and set 6 = 1, we used to write the partition function 
as e7". Comparing to this form suggests that e? can be viewed as temperature, with 
large e? corresponding to high temperature. This interpretation will be useful later. 
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We can do some simple dimensional analysis. The field n(x) must be dimensionless 
since it obeys the constraint (4.10). So, 


[e?] =2-d (4.13) 


In particular, e? is dimensionless in d = 2. Here the theory is weakly coupled when 
e? < 1, in the sense that field configurations n(x) with wild spatial variations are 
suppressed in the path integral. Before our rescaling, this corresponds to the case where 
@ parameterise a large S-t sphere. In contrast, when e? >> 1 these configurations 
are unsuppressed and the theory is strongly coupled; in this case the @ parameterise a 
small sphere. 


It is possible to write the sigma model in a more explicit form. We can decompose 
the vector n as 


n(x) = (7(x), 0(x)) 
where 7(x) is an (N — 1)-dimensional vector and a(x) is given by 
a(x)? = 1—#(x) - #(x) 


which ensures that the fields sit on the ground state manifold n-n = 1. The free energy 
is then given by 


1 
= d 3 = 
Fr] = fa ie 5a IVa Va t+ VoVo] 


= U)2 

= jes [vr v+ -o (4.14) 
This form of the sigma model does not have any constraint; it is an interacting theory of 
the Goldstone modes 7 (x). However, we have paid a price: only an O(N — 1) symmetry 
is now manifest in the free energy, rather than the full O(N) symmetry. This is because 
we have had to make a choice of which of the redundant n variables to eliminate in 
order to solve the constraint (4.10). Related to this, our free energy (4.14) is now only 
valid as long as o(x) # 0 anywhere, in which case the second term would diverge. 


SN-1 


This is because the 7 fields are coordinates on the space and it is impossible to 


introduce coordinates which are well behaved over the entire manifold. 


As an aside: the name “sigma model” is, obviously, completely uninformative. It 
has its roots in particle physics where a theory of this type describes the interactions 
of pions. Strangely, the eponymous “sigma” meson is the one particle not described by 
the sigma-model; it is analogous to the longitudinal mode o(x) which is determined in 
terms of the 7 fields in our description above. 
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4.3.1 The Background Field Method 


We would like to perform a renormalisation group analysis on the sigma model. There 
are a number of ways to proceed. First, we could Taylor expand the second term in the 
free energy for small 7. This would result in an infinite tower of interactions. We could 
then restrict attention to the first few, and do the kind of Wilsonian RG treatment 
we've seen before. This method works, but it butchers the underlying geometry and, 
in doing so, disguises what’s really going on. 


Instead, there is a better approach, first introduced in this context by Polyakov, 
called the background field method. First, suppose that n(x) takes some profile which 
varies slowly in space, 


This profile must obey ñ- ñ = 1. This will play the role of our long wavelength modes. 


On top of this background, we want to introduce short wavelength modes which 
change rapidly in space. To parameterise these modes, we first introduce frame fields. 
These are a basis of N — 1 unit vectors e*(x), witha =1,...,N and a = 1,...N — 1, 
which are orthogonal to n(x), 

n'(xje§(x)=0 Va and ef(x)e3(x) = dag (4.15) 
The frame fields are, like n°, slowly varying in space. There is an ambiguity in the 


definition of these frame fields; we can always rotate them by a local O(N — 1) trans- 
formation and we will still have a good set of frame fields. 


The short wavelength modes sit on top of our original field n(x) and fluctuate in the 
direction of the frame fields. We call these y,(x), and write the full configuration as 


n(x) = A(x) (L = x(x)?)"? + 5 Xa(x)ea(x) (4.16) 


By construction, this configuration still satisfies the constraint (4.10). This is morally 
equivalent to our previous Fourier space decomposition @ = @_ + ¢,, but now in real 
space. We will integrate out the short wavelength modes y and determine their effect 
on the long wavelength mode n. 
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Integrating out Short Wavelengths 


We have a short calculation ahead of us. Our plan is to expand the free energy to 
quadratic order in the short wavelength fields y,(x) and then integrate them out, in 
exactly the same way that we integrated out the Fourier modes ¢, previously. We will 
then interpret this in terms of an effective free energy for the long wavelength fields n* 
and, in particular, in terms of a renormalisation of the coupling e°. 


First, we have 
Vnt = Vai? (1 — x7)? + AV (1 — x7)? + V (xaet) 
SNL De) -Ñ XaV Xa + V (Xala) + O) 
The gradient term then becomes 
(Vn)? = (VA7)?(1 — x7) + (Vxa)* + Xaxs Vea Veg + 2(VXa)x pes Veg 
+ 2VATV (Xact) + O(x?) 


where we have used the identities (4.15). One of the cross-terms vanishes by dint of 
the fact that n°n? = 1 so that n°Vn? = 0. 


Our partition function is now 
Z= fon 6(m? — 1) en nen S davi)? [x en za S dax)? .—Frlfx] 
where the interaction between ñ^° and Xa are captured in 


~a 1 ~ a a a 

Frnt’, Xa] = aa f ate | =y (VA)? + XaXpV ea V Eg 
+ 2(VXa) xek Veg + 2V" V (xaek)| 

As previously, we interpret the functional integral over y as computing the expectation 

value of e~*! using the probability distribution exp(—54 f dfx (Vx)?). In other words, 


Be 
Z= fon He — 1) en zz J Pa(vay? (eF 


The expectation value can be Taylor expanded, 
z 1 
(edy = 1 — (Fr) + o +... (4.17) 


As usual, the renormalisation group will generate many terms when we integrate out 
x. Our interest is in how the leading order kinetic terms (V7)? are affected; all other 
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terms will turn out to be irrelevant. At leading order, it is sufficient to focus on (Fr). 
(Given the term linear in y, one might wonder if such a term can be generated from 
(F?): a closer inspection shows that this in fact gives rise only to terms like (V7)*.) 
We have 


(Fr) = aa f ae (—bae(Vin*)? + Ve&Ves) (xa(x)xXa(x)) (4.18) 


where we’ve used the fact that (x(x)) = 0 to lose the linear term and, on rotational 
grounds, ((Vxa)Xg) = 0 to lose another term. The correlator that we want takes the 
same form that we calculated in previous sections. (See, for example, (2.21).) 


where 


Here we’ve introduced the limits on the integral to reflect the fact that, as in our earlier 
RG analysis, the short wavelength modes — which are here y,(x) — have support only 
in A/¢ < k < A. The integral is simple to perform; we have 


¢-1 d=1 


A? x < logt d=2 
1-¢-4 d>3 


Qaa 
(27)? 


Ia 


where Qg—ı is the area of the unit sphere S¢!. Substituting this into (4.18), it is 
clear that the first term corrects the kinetic term in the sigma model. But what of the 
second term? Using the fact that the correlator is proportional to dag, it takes the form 
Veo Ves. We can massage this into the form we need using some identities. Between 
then, the fields (#7, e£) provide an orthonormal basis of R^. Inverting this, we have 


non? + ete? = 5” (4.19) 
Using this, we can write 
Vet Vet = VeVe (arn? + ezen) 
But since A'e% = 0, we have A Veg = —(Vn"7)e% so 


a a 
Veg Veg 


ete? (VA) (VAP) + (egVer)(eVer) 
= VAVA + (eg Vet) (e5Ver) 


a 


= iit 


where, in going to the second line, we’ve used (4.19) again, together with the fact that 
NIVA = 0. The first term is just what we want; the second term is a new term that we 
can add to the sigma model and will be generated by RG. It is related to the geometric 
concept of torsion; it turns out to be irrelevant and we will not discuss it further here. 


Both terms in (4.18) now give a contribution to (Vn%)?; the first is —d,360 = 
—(N — 1); the second is simply +1. The upshot of this is that the (Fr) includes the 
term 


(F) = (2-—N)Ig I d'z Evae) 


We now exponentiate this so that, to the order we’re working at, (4.17) becomes 
(e71) = e-\"1)_ This gives us an effective free energy for the long wavelength field 


n, 
~ a l1 o 7 
Fin] = | d'z pera Vn Vin 
with 
1 1 
— = 5+(2-N)I 
e!2 e2 + ( ) d 


Usually there are two further steps in the RG programme. First, we need to rescale 
our momentum cut-off back up to A, and in doing so rescale all length by 1/¢. This 
proceeds as before. The second step, advertised in Section 3, is to rescale the fields 
so that the kinetic term is canonically normalised. This step is not for us, since the 
normalisation of the kinetic term is the only coupling we have. Instead, the fields are 
normalised correctly by imposing the constraint (4.10). The upshot is that we have the 
running coupling constant 


1 1 
—— = ae (J +(2-N ta) 4.20 
HOS (gte i 
The first term comes from the naive dimensional analysis [e°] = 2 — d that we saw 


in (4.13); the second term is the one-loop correction from integrating out the high 
momentum modes. 


Note that this second term vanishes when N = 2. This reflects the fact that the 
Goldstone boson in the XY-model is free; this can be seen in (4.4) where there are no 
interaction terms. Interesting things happen in the XY-model but we will postpone 
discussion to Section 4.4. In contrast, for N > 3 the Goldstone bosons are interacting, 
as can be seen for the O(3) model in (4.5), and this drives the running of the coupling 
constant. 
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Figure 36: The beta function for e takes us away from the ordered phase at e = 0 when 
d < 2 and towards the ordered phase when d > 2. For d = 2 + e€, there is an unstable, UV 
fixed point. 


4.3.2 Asymptotic Freedom and the d = 2 + € Expansion 


Let’s look more closely at the running coupling (4.20). To start, consider what happens 
in d = 2 dimensions. As previously, we write 


A 
a= he? 
Ç 
and compute the beta function 
de e 
= — = (N — 2)— 
Bg) = = (N-22 


The beta function vanishes for N = 2. This is because, as we mentioned above, the 
Goldstone mode in the XY model is non-interacting. However, for N > 3 the beta 
function is positive. It means that e? is an example of a marginally relevant coupling: 
as we flow the infra-red, e? gets larger and the theory becomes strongly coupled. Cor- 
respondingly, the theory is weakly coupled in the ultra-violet. This property is known 
as asymptotic freedom, which refers to the fact that the theory is free at asymptotically 
high energies. 


Asymptotically free theories are rather rare in physics. Perhaps the best known 
example is QCD or, in general, Yang-Mills theory with some small number of matter 
fields. 


The sign of the beta function is telling us that, in d = 2 (and indeed in d = 1), 
the weakly coupled ordered phase that we started with is unstable. This is a man- 
ifestation of the Mermin-Wagner theorem that we mentioned in Section 4.2.3; there 
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are no Goldstone bosons in d < 2. Unfortunately to really understand the infra-red 
physics in these low dimensions we will have to figure out how to deal with the strongly 
interacting theory. We will introduce a particularly useful approach in Section 4.3.3. 


In higher dimensions, say d = 3, the beta function is negative. This means that the 
sigma-model flows towards weak coupling in the infra-red, telling us that the ordered 
phase is stable. 


However, there is now something new we can do. We can look at what happens in 
dimension 


d=2+e 


Here the beta function takes the form 


de € e? 
OS (a De 
ds g ) An 


This has a fixed point that lies within the remit of perturbation theory, namely 


2_ QTE 
* N-2 


e A" (4.21) 
However, in contrast to the story of Section 4.2, this is now a UV fixed-point, rather 
than an IR fixed point. How should we interpret this? 


To understand this, let’s recap the story so far. The O(N) model, with unconstrained 
fields @, has a Wilson-Fisher fixed point in dimension d = 3. This has one relevant 
deformation which is, roughly speaking, °. If we turn on this relevant deformation 
with negative sign, we flow to the ordered phase which is described by our sigma model. 


What we’ve seen above is this story in reverse. Starting from the ordered phase, 
described by the sigma model, we have managed to claw our way back up the RG flow 
to find a UV fixed point, at least in dimensions d = 2 + e. It is natural to identify 
this with the Wilson-Fisher point, viewed through different eyes. This provides us with 
a different handle on the O(N) Wilson-Fisher fixed points for N > 3; we can either 
approach them from above using the d = 4 — e expansion, or from below using the 
d = 2 + € expansion. 


To extract the critical exponent v, we need to understand how e? is related to the 
temperature. From our definition of the path integral (4.12), we see that 1/e? sits in 
the exponent where 3 would sit in a usual partition function. This motivates us to 
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identify e? with temperature T and the fixed point e? with the critical temperature T,. 
We then linearise about the fixed point by writing e? = e? + de? to find 


d(de? 
Ce = +e de” 
ds 
This gives A; = € and, correspondingly, the critical exponent 
1 
v= -= (4.22) 


€ 
independent of N. To compute the critical exponent 7, one could add the interac- 
tion fd’x B - n(x) or, alternatively, extract the anomalous dimension of n from the 
calculation above. One finds that 

-E 

 N-2 


The d = 2 + € expansion does not give great results if we just go ahead and plug in 


n 


€ = 1. But then, there is little reason that it should! For example, for N = 3, we can 
compare the best known results with mean field, and with the d = 4 — € and d = 2 + € 
expansions, where we work to first order and plug in € = 1. We have 


n V 

MF 0 5 

d=4-e/ 0 061 
d=2+€ 1 

Actual | 0.0386 0.702 


Nonetheless, there is some utility in having two expansion parameters, coming from 
different ends. By going to higher powers in €, one can try to use sophisticated matching 
techniques to join together the two expansions and get a better handle on the values 
of the critical exponents for d = 3. 


4.3.3 Large N 


So far, we have seen that the dynamics of interacting Goldstone modes (i.e. N > 3) 
becomes strongly coupled in d = 2 dimensions. But we have yet to figure out what 
actually happens. 


Questions like this are typically hard. As e? — oo, it naively appears that all field 
configurations contribute equally to the path integral, no matter how wildly they vary 
and how far they are from the saddle point. We have very few techniques to deal with 
such situations. Often we have to turn to some hidden and surprising symmetry, or to 
some unusual limit where the theory is soluble. 


= T21- 


In the present case, it turns out that such a limit exists: it is M — oo. To proceed, 
we first rewrite the delta-function in the path integral (4.12) as 


1 
= | DaDo exp -z | ae y e d’x o (n-n— 1) (4.23) 
2€5 2€5 
Here the field o(x) plays the role of a Lagrange multiplier; integrating it out gives us 
back the delta-function, imposing the field constraint n? = 1. 


Now, however, we’re left with a free energy which is quadratic in the n. Instead of 
integrating out o, we can instead integrate out n. This gives us 


Z= fo det 70 (—V? + io(x)) exp (fe o) 
0 


Here the determinant of the differential operator should be viewed, in the usual way, 
as the product of all its eigenvalues, with a truncation associated to the UV cut-off 
A, reflecting the fact that the eigenfunctions can’t oscillate at high frequencies. This 
determinant will, in general, be a complicated function of a, and it does not look as if 
we are any closer to evaluating the path integral. We can, however, use the standard 
“log det = tr log” identity to write the partition function as 


A= |v exp (-Fe log (—V? + io) + 5a jez o) (4.24) 
0 


The factor of N in front of the first term is what gives us hope because, in the limit 
N —> ov, this term is then crying out to be evaluated by saddle point. However, we’re 
still left with the second term. We can only apply the saddle point technique to this too 
if we scale the coupling e? with N in a particular way. Specifically, we send N — on, 
keeping e3N fixed. 


The path integral (4.24) is then dominated by the minimum. We use the identity 
dtrlog X = tr X~'6dX, to find that the saddle point is 


1 


N 
—G(x,x) = T 


3 (4.25) 


where G(x, x’) is the Green’s function for the operator (—V? + io (x)). This equation 


looks somewhat foreboding, but is rather simple in Fourier space. First, we look for 
constant solutions, of the form 


= 22:5 


Note the factor of i; our saddle point sits on the complex plane, but is nonetheless still 
applicable. The saddle point (4.25) then becomes simpler in Fourier space: we have 


A gd 

d°k 1 1 

f PE. (4.26) 
(2r) k? +p? eN 

where we’ve explicitly included the UV cut-off A in the integral. This equation should 


now be viewed as an equation for p/?. 


Large N in d=2 


It is perhaps no surprise by now to learn that solutions to (4.26) depend on the dimen- 
sion d. Our main concern was with the fate of the Goldstone bosons in d = 2. Here 


1 A + pe 1 
l = 4.2 
gr °° ( L? ) e2N 20 


the integral gives us 


If we start with a weakly coupled theory in the UV, eN <« 1, then we can self- 
consistently assume that u < A to find the solution 


pe Ae Ore (4.28) 


This simple formula is interesting for several reasons. First, let’s understand the phys- 
ical interpretation of setting o = —iu? # 0. Referring back to (4.23), we see that: it 
induces an effective quadratic term for n? in the free energy. This kind of term was 
supposed to be prohibited by Goldstone’s theorem, but here we see that it is generated 
—at least in the large N limit — by thermal fluctuations in d = 2 dimensions. This 
means that the Goldstone bosons in d = 2 are no longer gapless. Correspondingly, if 
we compute their correlator using (4.23), we will see that it decays exponentially, with 
a finite correlation length given by € ~ 1/p. 


The second interesting fact about (4.28) is that the dynamically generated scale ju is 


-1/2 has the lovely 


exponentially smaller than the UV cut-off A. Indeed, the function e 
property that its Taylor expansion around x = 0 vanishes at every order in x. This 
means that the gap u will not show up in any order in perturbation theory in eọ. We 


say that it is a non-perturbative effect. 


Although the calculation we presented above is valid for N > 1, it turns out that 
the conclusions hold for all N > 3; that is, for any theory of interacting Goldstone 
bosons in d = 2 dimensions. This means that there is no phase transition for O(N) 
models with N > 3 in d = 2 dimensions. As we lower the temperature, mean field 
theory suggests that we enter an ordered phase with gapless excitations, but this is 
misleading: instead, thermal fluctuations destroy both the order and gapless modes. 
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The discussion above carries over directly to quantum field theory, where non-linear 
sigma models in d = 1+1 dimensions are also of great interest. Here the interpretation 
of the calculation is that the Goldstone modes — which appear to be massless in the 
classical action — get a mass due to quantum effects. If one didn’t think carefully about 
the meaning of quantum field theory this appears miraculous because the sigma-model 
in d = 1+ 1 dimensions has only a dimensionless coupling e. Yet somehow, the theory 
generates a mass out of this dimensionless coupling, a phenomenon that is known as 
dimensional transmutation. The reason that this is mathematically possible is because 
a quantum field theory, like is statistical counterpart, is not defined by the classical 
action (or free energy) alone. It also requires a UV cut-off A. And, as we see in (4.28), 
it is this UV cut-off which provides the dimensional scale for the mass. 


Finally, I should mention that if you can do a calculation like the one above for Yang- 
Mills in d = 3+ 1 dimensions (or, indeed, in d = 4 dimensions) then fame and fortune 
awaits. The massless gauge bosons that appear in the classical action are strongly 
believed to get a mass through quantum effects, yet this remains to be proven. This 
is the famous “Yang-Mills mass gap” problem. The O(N) sigma model ind =1+1 
dimensions provides a useful analogy for how this might happen. 


Large N ind> 2 


We can also ask if our large N analysis can shed any light on the Wilson-Fisher fixed 
point in d = 3. (Or, if you’re willing for dimensions to wander, in 2 < d < 4.) Here we 
find something interesting. The saddle point equation (4.26) has different behaviour 
for 2 < d < 4 and for d > 4, 


i S a d'k 1 AN da (4.29) 
BN J (mik +u? | A52- pt? Qed 


where we haven’t been careful about the coefficients in front of either term, except to 
stress that the second term comes with a negative sign relative to the first. (We also 
analysed the behaviour of this integral in (2.13), but there only kept the leading term.) 
This equation now has rather different behaviour than the corresponding equation 
(4.27) in d = 2. In particular, when the theory is weakly coupled at the cut-off scale, 
in the sense that 


eeN aA 


there are no solutions to (4.29) for u?. In this case, one finds that the saddle point of 
the free energy actually arises when n gets an expectation value. In other words, it’s 
reconfirming our expectation that the low-energy physics is that of Goldstone bosons. 
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In contrast, as the theory becomes more strongly coupled at the cut-off scale, there 
is a critical value 


eN ~ A2-4 
at which solutions to (4.29) for u start to appear. 
As in the previous section, we identify the deviation from e? with the temperature, 
T= Tnne e 


We can then ask how the correlation length € ~ 1/u diverges as we approach this 
critical coupling from above. Here the story is different for 2 < d < 4 and d > 4, 
because of the different behaviour of the subleading term in (4.29). For 2 < d < 4, we 
have 


T-T E 


which gives the critical exponent 


1 


"= g= 


Rather wonderfully, this agrees precisely (for all 2 < d < 4) with the result of our 
d = 2 + € expansion (4.22), and with the large N limit of our result from the d = 4 — € 
expansion (4.7). Indeed, this result is exact in the N — oo limit and can be used as 
the starting point for a 1/N expansion 


Meanwhile, when d > 4 we can read off the behaviour from (4.29); we have 


1 
Tae. > v=- 
2 
This, of course, is the mean field value that we expect. 


4.4 The Kosterlitz-Thouless Transition 


The Mermin-Wagner theorem means that any system with a continuous symmetry has 
no ordered phase in d = 2 dimensions. As we saw in the previous sections, for the 
O(N) model with N > 3, the would-be Goldstone modes are interacting and become 
gapped as a result of the thermal fluctuations. This means that these models do not 
exhibit a phase transition as the temperature is lowered. 


= 25- 


However, the results of the previous section do not hold for the XY model with 
N = 2. In this case, the sigma-model coupling does not run, and the system remains 
gapless at low temperatures. As we will now see, the resulting physics is rather more 
subtle and interesting. 


The first surprise is that the d = 2 XY model does exhibit a phase transition as 
the temperature is lowered. However, it is somewhat different from the kind of phase 
transitions that we have met so far. In particular, as we saw in Section 4.2.3, thermal 
fluctuations mean that there can be no spontaneous breaking of continuous symmetry 
in d = 2 and, correspondingly, there is no local order parameter that distinguishes the 
two phases. Instead, that task falls on the correlation function. 


In the high temperature phase, we work with the complex field w. The free energy 
has a quadratic term ju?|7)|? and, as we’ve now seen many times (starting in (2.29)) the 
correlation function decays exponentially 


eaT/é 


(yi (x)y(0)) = T (4.30) 


with € ~ 1/u?. In contrast, in the low temperature phase we have u? < 0 and, as 
we described in Section 4.2, we can write y = Met’, with the long distance physics 


dominated by 6. To leading order, we can write the free energy as 
Fé] = afer (ve)? (4.31) 
e 


Very low temperatures correspond to e? < 1. 


The correlation function for this Goldstone mode exhibits a log divergence (4.9), 


2 


e 
(0(x)0(0)) = -57 log(Ar) 
To compare to (4.30), we should look at 


(6-790) e100) — (e7ilO-00))) — 9 ((0(0)-0(0))?)/2 


where the final equality follows because we are dealing with a Gaussian theory (4.31) 
and so can employ Wick’s identity (3.34). We learn that, in the low temperature phase, 
the correlation function for the XY model takes power-law form 


—10(x) ,10(0)\ 
(e ev’) = (4.32) 
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where the anomalous dimension 7 is given by 


Note that this power-law does not occur just at a critical point, but for a range of 
temperatures. As we increase the coupling e?, which is equivalent to increasing the 
temperature, the anomalous dimension increases. We see that the coupling e? in the 
XY model (which can be traced to the u? < 0 coupling in the original theory) is 
something rather rare: it is an example of a genuinely marginal coupling. 


The correlation function exhibits two different behaviours in the high temperature 
(4.30) and low temperature phases (4.32). This suggests that there may be a phase 
transitions between them. The fact that the order parameter for this phase transition 
is non-local — it involves the position of fields at two distinct points rather than one — 
is our first hint that this phase transition has a slightly different smell from others. As 
we will now see, this is not the only thing that sets it apart. 


4.4.1 Vortices 


The mechanism for the phase transition can be found within the sigma model approach 
(4.31), but involves something a little novel. The novelty arises from the fact that, in 
contrast to the Ising field (x) that we worked with in Section 3, the field (x) is 
periodic. There can be field configurations, localised around a point x = X, in which 
(x) winds some number of times, 


$ VO- dx = 2mm withn € Z 


Crucially the winding number n must be an integer so that 0 comes back to itself up 
to a 2r shift. A configuration with n = 1 is referred to as a vortex; when n = —1, it is 
an anti-vortex. These are examples of topological defects. The configurations of lattice 
spins that correspond to a vortex and anti-vortex are shown in the figures. 


At the location of the vortex, x = X the field 6(x) is not well defined. One way to 
proceed is to revert to the original XY model and allow the magnitude of Y to vary 
close to the core. However, for our purposes it will suffice to do something simpler: we 
just admit ignorance on short distance scales, and say that the vortex has some core 
size which we denote as a. This will later play the role of the UV cut-off in our system. 
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Figure 37: A vortex... Figure 38: ... and an anti-vortex. 


We'll start by giving a rough and ready derivation of the effect of vortices. A config- 
uration with winding n has V6 = 5 (y, —a), and so free energy 


Fyortex = = f Pax (V0)? = ae log (<) A> Fae (4.33) 
2e? e? a 

where, in addition to the UV cut-off a, we also need to place the system in a finite size 

L to avoid a long-distance divergence in the energy. We’ve also included a contribution 

from the vortex core region r < a which depends on the microscopic details. Note 

that the free energy of multi-vortices, with |n| > 1, scales as n? and so is energetically 

disfavoured. For this reason we focus on configurations with n = +1. 


The logic now is very similar to the story of domain walls in dimension d = 1 that 
we met in Section 1.3.3. The probability of a vortex configuration arising in the system 
is enhanced by the fact that it can sit anywhere; this gives an extra factor of (L/a)?. 


We then have 
L 2 e7 Frortex e7 Fore L 2=r/e? 
t = = 
p(vortex) (=) 7 z (=) 


We see that, when e? surpasses a critical value, 


E Sekn i (4.34) 
then there is no suppression of vortices; their entropy, coming from the fact that they 
can sit anywhere on the plane, wins out over their energetic cost. As in the previous 
section, e? can be viewed as the temperature of the system, and eẸĝyp translates into 
a temperature scale Tgr, above which vortices proliferate. This, it turns out, is re- 
sponsible for the change in the behaviour of the correlation function, with the vortices 
randomising the phase 0, destroying the delicate power-law fall off (4.32). 
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This phase transition, driven by proliferation of vortices, is known as the Kosterlitz- 
Thouless transition, and is important both for superfluid films, and for the melting of 
two-dimensional lattices, where the defects play the role of vortices. It is also known as 
the BKT transition, as the Russian theorist Berezinskii was the first to appreciate that 
such a transition is possible, although he didn’t fill in all the details. It is sometimes 
referred to as a topological phase transition, because it is driven by the proliferation of 
topological defects. 


Michael Kosterlitz and David Thouless are both Brits, educated in Cambridge, who 
subsequently moved to the US. In fact, Thouless was the first Director of Studies of 
physics in Churchill College. They shared the 2016 Nobel prize in physics for their 
work on this transition. 


A Coulomb Gas of Vortices 


The quick discussion above shows that vortices proliferate when e? gets too large. But 
we can do better. The first step is to appreciate that what really emerges as we increase 
the coupling is a gas of vortices and anti-vortices. The Kosterlitz-Thouless transition 
is better thought of as an unbinding of vortex-anti-vortex pairs. 


To see this in more detail, we will first look at the interactions between vortices. To 
this end, it’s useful to think in terms of the vector field 


v= V0 (4.35) 
In the context of superfluids, this is the velocity field. The equation of motion for 0 is 
Vo=0 > Vev=0 (4.36) 


A smooth vector field defined by (4.35) would obey V xv = 0. However, in the presence 
of vortices, the 0 field admits singularities and, correspondingly, the velocity field obeys 


Vxv=2r2 Son 6°(x — X;) (4.37) 


i=1 


where z is the unit vector out of the plane, and n; = +1 determines the charge of the 
vortex at position x = X. 


We can perform a change of variables to transform (4.36) and (4.37) into more familiar 


E v 
Ei = €;jUj > i = 2 
Es —v1 


= 


equations. We define 


and the equations of motion then become 


VxE=0 and V-E=2r) nj6°(x — X,) (4.38) 


These are the Maxwell equations for the auxiliary electric field E, with the vortices 
acting as “electric charge”. This means that we can import our machinery from our 
course on Electromagnetism; the only difference is that our electric field lives in d = 2 
spatial dimensions. For example, to determine the interaction between two vortices, 
we need to solve the Gauss’ law equation in (4.38). We do this by writing E = —Vy, 
where 


=e log (= wales ‘) (4.39) 


The free energy (4.31) can be expressed in terms of the electric field as 


1 
= fez DER | d's = aa Va 


This looks very similar to our starting point (4.31), except that the relationship between 
the original field 0(x) and the new field x(x) is given by 0:0 = eijðjx, which is not 
straightforward to solve. However, it is now straightforward to compute the free energy. 
First integrating by parts, we have 


F= |ds - xy: B= F min, tog ( Ž EH) + te 
iAj 


where, to get the second equality, we’ve substituted in the expressions (4.38) and (4.39) 
and, for the cases i = j, replaced our expression with the energy of the core of the 
vortex. We learn that the interaction between vortices grows logarithmically. This is 
the Coulomb force in d = 2 dimensions; it is repulsive for vortex pairs, and attractive 
for a vortex-anti-vortex pair. 


We can now use this expression to write down an expression for the partition function 
of the XY sigma model. There are two contributions. To isolate these, we decompose 
the velocity field as 


V = Vsw + Vortex 
The first of these obeys 


V X Vey = 0 
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This is circulation-free flow in the absence of vortices. It describes the contribution from 
the fluctuations of 0: we call these “spin waves”. The second contribution comes from 
vortices and obeys V - Wyortex = 0. The free energy, and hence the partition function, 
then factorise into two 


Z= Zsw Zyortex 


The spin wave piece is harmless; it shows no sign of a phase transition. Meanwhile, the 
vortex piece contains contributions from all number of vortices and anti-vortices. We 
restrict attention to configurations that have equal number of vortices and anti-vortices, 
as these don’t suffer the IR divergence (4.33) in their free energy. We’re left with 


T X; — xX; 
Zvortex — =1 RI 2 p I [exrex i exp (z X nin; log (551) (4.40) 


ifj 
where the initial 1 comes from the configuration with no vortices, and the y = e~ fee /a? 


can be thought of as the fugacity of vortices. Here Xj denote the positions of p vortices, 
and X; the positions of p anti-vortices. Meanwhile, the argument of the logarithm 
involves the sum over the separations X; — X;, i Æ, j of all 2p (anti)-vortices, regardless 
of their charge. Finally, the integral should be taken over all |X; — X,| > a so that the 
cores of vortices do not overlap. 


Zvortex 1S the partition function of a neutral Coulomb gas in the grand canonical 
ensemble, with the + charges interacting through the 2d Coulomb force. 


We would like to understand the phase structure of Zyortex as the coupling e° is 
varied. There are different ways to go about this. One possibility is to implement the 
RG directly on Zyortex. This proceeds by integrating out the vortices that are separated 
on by some short distance scale a, effectively increasing the UV cut-off scale a. Here we 
will take an alternative approach. We will first map the Coulomb gas to a seemingly 
very different problem, one which will be more amenable to the traditional RG methods 
that we’ve been using in this course. 


4.4.2 From Coulomb Gas to Sine-Gordon 


The Coulomb gas (4.40) lies in the same universality class as the so-called Sine-Gordon 
model. This is a theory of a real scalar field $(x), with free energy, 


= I ar Livo — \cos(8¢) (4.41) 


The name is a physicist’s version of a joke: it is a play on “Klein-Gordon” theory! 


'1Sidney Coleman has a famous paper on this model which starts with the sentence “The Sine- 
Gordon equation is the sophomoric but unfortunately standard name for...”. 
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We start by giving a quick derivation of the equivalence between the Sine-Gordon 
model and the Coulomb gas. We will be fairly heuristic. It turns out that this mapping 
is somewhat simpler if we revert back to a spatial lattice, rather than working in the 
continuum. 


To this end, we introduce a lattice with spacing a with lattice sites X,. On each 
lattice site, we include a variable V, which can take values V, = —1,0,+1. The 
interpretation is that if V, = +1, there is a vortex at this site; if V, = —1 there is 
an anti-vortex; and if V, = 0 the site is empty. We allow V,, to only take these three 
values to reflect the fact that two vortices feel a large repulsion, which means that they 
effectively have a hard core, while a vortex and an anti-vortex annihilate to nothing if 
they come too close. 


The grand canonical partition function (4.40) can then be rewritten as 


vertu > exp (z >D Va Va log (= Xa- žl) = ` VR) (4.42) 


{Va} a#B 


We restrict the sum {Va} to configurations that are neutral, so >, Va = 0. This mimics 
the sum over all numbers and positions of vortex-anti-vortex pairs. 


To proceed, we will use the fact that the log that appears in Zyortex is the Green’s 
function for the 2d Laplacian V?. In general, we have 


[Pe ex (— f Ee 5170)? + foo) ~ exp (= f edy fog- yit) 


-1/2 which gives an unim- 


where we’ve dropped a factor of the determinant det(—V7) 
portant overall contribution to the partition function. Using this, the partition function 


(4.42) can be rewritten yet again as 
271% 
Zvortex ~ 5 fos exp (- fees ~(V¢)* a a a n= VF, Fan) 
{Va} 


where we’re using a slightly unholy mix of continuous notation and discrete notation. 
You should think of ġa = ¢(Xq) as the value of $(x) at the lattice site, and write your 
preferred discretised version of the kinetic term. Now we can do the sum over the Vz; 
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we have 


Zvortex = fo exp (-; fee (vo) I] > e T Vabo—Va Feore 
= |v exp (-5 pe (vo?) I] f + 2e Feore cos(2r¢a/e)| 


N [Pee (-; fee (Vo)? + sete | da cos (=)) 


This is the Sine-Gordon model, as promised. Although our derivation used an under- 
lying lattice, the final result is expressed as a continuum field theory, and this is the 
form we will use moving forward. As always, however, the memory of the lattice will 
remain in the UV cut-off scale a. The dictionary between the couplings in (4.41) and 
those of the original XY-model are 


2e Fore 2T 


A=—— and f= — 
e 


ae 
We will now see how these couplings fare under the renormalisation group. 


4.4.3 RG Flows in Sine-Gordon 
We apply our standard RG programme to the Sine-Gordon model, 


F= fex Evo — Ao cos( Bog) 


where we’ve added the subscript 0 to reflect that fact that this free energy is defined 
at the cut-off scale A. 


What follows next is familiar. We work in Fourier space and decompose the field ġ 
into low and high momentum modes, 


Pk = Pk tE 


where œ} includes all modes in the momentum shell A/¢ < k < A. We also define ¢7 (x) 
and t(x) in real space as the inverse Fourier transform of ¢, and ¢ respectively. 


We then integrate out the high momentum modes to leave ourselves with an effective 
free energy, 


F'I97] = Folo] — log (e497) (4.43) 
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where 
Folo] = je (Vo) and Frle] = -> f da cos(So¢) 


and the expectation value reflects the fact that we’re integrating out the fast momentum 
modes, weighted with 


(e-Frle +04) — f Dot e-Folio") e- Filo +0") 
Our goal is to compute this effective free energy. 


First Order in Ao 


We will assume that is suitably small, so that leading order term is 
(exp (o fè cos (8(¢7 + 6) me1+ no f èz (cos (8(¢" + ¢*))), 
+ 
= Ao fp iBop” | piBoot* 
=145 fer De (e A 


Meanwhile, we can use our handy Wick identity (3.34) to write 


(Jer — e782 (b+ 0T (x))+ 
+ 


The propagator for the fast mode, evaluated at the same point, is 


A 2 
OO) = | oop = RE (4.44) 


The upshot of this calculation is that the interaction term is renormalised after inte- 
grating out the high momentum modes, and becomes 


1 2 
FI] = f dx 500")? — roo #/ c0s(36-) 
and the coupling Ao becomes 
N = MEEA 


The next step of the RG is the rescaling, x + x’ = x/¢. It’s simple to check that the 
rescaling of the field is trivial. With this, our free energy becomes 


P| = f dx ZVA -AC eos( 0) 
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with 
MOH r (4.45) 


Already we can see the essence of the Kosterlitz-Thouless phase transition in this 
equation. When £ is suitably large, 
T 
B>8& > < z 
then the effect of RG is to reduce A. This means that the coupling Acos(8¢ġ) is an 
irrelevant operator and, as we flow towards the infra-red, A —> 0. In this case, the 
free energy for ¢ is given just by the gradient terms, and the correlation function will 
exhibit a power-law fall off. 


In contrast, when £ is small, 

T 

2 

the operator Àcos(8ġ) becomes relevant, growing as we go towards the infra-red”. 
Now the minimum of the potential is at ¢ = 0 mod 27/2. Expanding the cos potential 


B<8m# => e> 


about this minimum gives a quadratic term for ¢ and correlation functions will now 
be exponentially suppressed, with a finite correlation length. It is perhaps surprising 
that our sophisticated RG analysis gives exactly the same value for the critical coupling 
g2 = 7/2 as our previous, hand-waving discussion of vortex proliferation (4.34). 


Using our earlier result (4.32) for the anomalous dimension, we see that at the phase 
transition, e? = 7/2, the system exhibits a universal anomalous dimension, 


1 


(ered ge) R AR 


Second Order in Ao 


At order AZ, we find ourselves with the double cosine 


(cos(S(¢, + $£)) cos(B(dy + ¢F)))+ = : > a (ei BRO) 


o=+1 


4 eP -ty) (eil) | 


1 z ECEN +t 
= 5 [east A(de + gp) PASP 


+ cos A(z — oy) e P] 


12This same result is derived in a very different way, using conformal field theory, in the lectures 
on String Theory. It can be found in Claim 2 of Section 4.3.3 where, in the notation of that course, 
the operator e’** is shown to have dimension A = a/k?/2. A quick check of the conventions for the 
propagator shows that we should set a’ = 1/27 so that A < 2 and the operator is relevant if k? > 87. 
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where, as a space-saving measure, I’ve put the spatial position as a subscript, (x) = x. 
Taking the logarithm in (4.43) means that we subtract the disconnected diagrams, 
(cos(G(¢, + ¢£)))2.. The upshot is that, at order Xj, the effective free energy includes 
the piece 


AG + + 
Fle ]|2 = $ f Pady { cos(3(ox + ¢5)) eae een _ Sere ete 


+ cos(G(¢, — ¢y)) eae ws — 6B? /2Ubx bx)+ e—B?/2(by OF )+ 


The expectation values that sit in the exponents are given by 


A k elk (xy) 
cee-vid rms e 


Previously we needed only G(0; ¢) = + log ¢; now we see that the correlator at spatially 
separated points also arises. We have 


FO Ma = pac | Pady {collez 4) [e -1 
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At first sight, we seem to have a non-local free energy involving a double integral. To 
massage it into something more familiar, we need to realise that the function G(r) 
receives contributions from a small sliver of Fourier modes, and so decays quickly at 


distances r > ¢/A. This means that the functions [e*?¢-¥) — 1] are non-zero only in 
a small window |x — y| ~ ¢/A. 


We write y = x + v and Taylor expand the cos factors in the integral for small v. 
For the first, we have simply 


cos(3(o, + Qy )) ~ cos(26¢,) 


The second is more interesting for us; we have 


2 
cos(8(dx — &)) = 1- ver? 
Our free energy, at order \?7, then becomes 
pe 1, 2 2 
F"| E = xfa ZENG) cos(28ġ) + A2(¢)(V¢) \ + const. (4.46) 
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where all the messy details have been absorbed into two functions 
ayo term fie [ereo 
1 
Axl) = ~F0-P np f do v? [etso — 1] (4.47) 


We see that the RG flow has generated two terms for us in (4.46). The first, cos(28¢), 
is something new: it can be viewed as the effect of two vortices, which we didn’t 
include in our original Sine-Gordon model but is generated upon integrating out high 
momentum modes. We will not need this here. The second term is something familiar: 
it is renormalisation of our kinetic term. The final steps of the RG procedure tell us 
to rescale space, x — x’ = x/C, but also rescale the field ¢ so that the kinetic term 
remains canonically normalised, as in (3.6). We have 


P(x’) = V1 + à542(0) P(X) 


This rescaling gets spat out inside the potential, where it has the effect of renormalising 
our other coupling, 8. We have 


BO) = Bo (1+ XAO) = Bolt — £4840) (4.48) 


Recall that, in terms of our original XY model, 8? = 4r?/e?. Looking back to our XY 
sigma model (4.31), we see that it renormalises the 1/e? coefficient of the kinetic term. 
This is sometimes called the spin wave “stiffness”, since it measures how difficult it is to 
twist the spins. The intuition behind the result (4.48) is that a gas of vortex-anti-vortex 
pairs screens the spins, reducing their stiffness. 


Beta Functions 


Our task now is to understand the global properties of the resulting RG flow. We write 
down the beta function in terms of ¢ = e°. From (4.45) we have 


Meanwhile, from (4.48), we get 
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where C(8) > 0 is a positive function that we could extract from the formula (4.47); 
it’s exact value will not be important for us. Written in terms of e? = 47?/8?, this 
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Figure 39: RG flows for the KT transi- Figure 40: RG flows, zoomed in at the 
tion. critical point. 


latter RG equation becomes 
a = 8r’ On 
ds 
The global structure of the RG flow is shown in the figure. To get a sense for this, 
first note that if \ is small, and e? < 7/2, then À will be rapidly driven to zero; this 
is the low-temperature phase in which correlation functions drop off with a power-law. 
Meanwhile if e? >> 7/2 then will be pushed large. This is the high temperature 
phase, with a non-vanishing correlation length. In the high temperature phase, we see 
that e?(s) — œo as s 4 oo. Meanwhile, in the low temperature phase e?(s) is finite as 
s — oo. At the transition, there is a jump 
ij 2 
T 


a 
e s—> 00 


To see the larger picture, it’s best to zoom in to the phase transition itself. We define 
r=°-3 and y=v8r?C A 


For x and y small, the beta functions become 


dx J dy 
kE d 2x 
ds M oe ds i 
From this we can compute 
dx? dy 2 d 
ds H R ge ae my) 


In other words, close to the critical point, the flows are hyperbolae. A general flow 
can be written as 


r? — ny? = J = x? — ry 
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where (£o, Yo) are the initial “bare” values of the couplings at the cut-off scale. There 
are two regions with J > 0; these correspond to the low and high temperature regimes 
that we discussed above. The separatrix at J = 0 is given by x = +,/7y. The line 
with x = —,/zy flows directly to the critical point. The line with « = +,/7y flows 
away from the critical point. 


Suppose that we start on the left of the figure, with « < 0. Then the initial data, 
shown as a dotted line in the figure, is (£o, yo). As we vary this data, we pass through 
the phase transition. In this sense, it is natural to think of this initial data as a function 
of the temperature (xo(T), yo(T)), with 


JT) ~T- T 
ensuring that we hit the critical point when J = 0. 


Finally, we can ask about the correlation length €. To compute this, we can look at 
flows with J < 0 which don’t quite hit the y = 0 axis. We have 
dx z l 


1 
Z Syp = (0? J) = (0? I) 
ds T T 


which we can solve to give 


gale" (a) e a) 


This has the slightly odd property that z — oo as s remains finite. This is an artefact 
of our approximation but, nonetheless, can be used to our advantage. By the time 
x & 1, we also have y ~ 1 and the theory is in the gapped phase. We can stop the RG 
flow here and use this as a proxy for our correlation length which, as we approach the 
phase transition from above, scales as 


1 
E ~ ae? ~ exp (sa) ~ exp (a=) 


We’re are used to a fairly soft divergence in the correlation length as we approach 
the critical temperature. For the Kosterlitz-Thouless transition, the change is much 
more dramatic. This also has an effect on the thermodynamic free energy which, as 
an extensive quantity scales as F ~ (L/€)?. As we approach the phase transition from 
above, we have 


1 1 
Fihermo ~ e ~ exp (-- 7) 
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This is a very weak singularity. There is no discontinuity in the heat capacity. Moreover, 
there is no discontinuity in any derivative of the free energy. In terms of Ehrenfest’s 
original classification, the Kosterlitz-Thouless transition is rather strange: it is a phase 
transition of infinite order. 
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